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Abstract 



This thesis describes a numerical study of binary boson stars within the context of an approxi- 
mation to general relativity. Boson stars, which are static, gravitationally bound configurations of 
a massive complex scalar field, can be made gravitationally compact. Astrophysically, the study 
of gravitationally compact binaries — in which each constituent is either a neutron star or a black 
hole — and especially the merger of the constituents that generically results from gravitational wave 
emission, continues to be of great interest. Such mergers are among the most energetic phenomena 
thought to occur in our universe. They typically emit copious amounts of gravitational radiation, 
and are thus excellent candidates for early detection by current and future gravitational wave 
experiments. 

The approximation we adopt places certain restrictions on the dynamical variables of general 
relativity (conformal flatness of the 3- metric) , and on the time-slicing of the spacctime (maximal 
slicing), and has been previously used in the simulation of neutron stars mergers. The resulting 
modeling problem requires the solution of a coupled nonlinear system of 4 hyperbolic, and 5 elliptic 
partial differential equations (PDEs) in three space dimensions and time. We approximately solve 
this system as an initial-boundary value problem, using finite difference techniques and well known, 
computationally efficient numerical algorithms such as the multigrid method in the case of the 
elliptic equations. Careful attention is paid to the issue of code validation, and a key part of the 
thesis is the demonstration that, as the basic scale of finite difference discretization is reduced, our 
numerical code generates results that converge to a solution of the continuum system of PDEs as 
desired. 

The thesis concludes with a discussion of results from some initial explorations of the orbital 
dynamics of boson star binaries. In particular, we describe calculations in which motion of such a 
binary is followed for more than two orbital periods, which is a significant advance over previous 
studies. We also present results from computations in which the boson stars merge, and where 
there is evidence for black hole formation. 
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Chapter 1 
Introduction 



One of the most revolutionary predictions of general relativity — Einstein's spectacularly successful 
theory of the gravitational interaction — is the existence of gravitational waves. Binary systems 
composed of gravitationally compact objects Q such as two black holes, two neutron stars or one 
neutron star and one black hole, are among the most promising sources of these waves. According 
to Einstein's theory, such binaries have orbits which decay due to the emission of gravitational 
radiation. This decay is thought to ultimately result in a violent "plunge and merger" of the two 
objects, characterized by an extremely strong and dynamical gravitational field, and the emission 
of an especially strong burst of gravitational radiation. An international network of laser interfer- 
ometer detectors (e.g. LIGO [I], VIRGO 12], TAMA300 and GEO600 g]) has been constructed 
to detect these bursts (as well as other sources), where the detection sensitivity is best for compact 
objects having masses of the order 1 to 10 times that of the sun. However, even for such strong 
sources, the terrestrial signal strength from a typical event (not likely in our own galaxy, or even in 
our local group of galaxies, on the time scale of years) is expected to be much less than the inherent 
noise in the instruments. Faced with this situation, the most promising technique to extract the 
signals from the noise is matched filtering [5], which involves comparison of the measured signal 
against a known (precomputed) family of waveforms. Construction of such template waveforms is 
thus an urgent problem, and it requires accurate theoretical modelling of the process of compact 
binary inspiral. 

Due to the complexity and nonlinearity of the Einstein field equations for the relativistic grav- 
itational field — to which one must add the governing equations for any matter fields which are 
involved (such as a perfect fluid in the case of neutron stars) — accurate modelling of the late 
phases of binary inspiral and merger requires a numerical approach. Now, as we will discuss in 
more detail shortly fScc. [TTT|) . the computer simulation of strongly interacting compact binaries 
has seen tremendous advances over the past few years. This is especially true for the case of the 
inspiral and collision of two black holes, which although a formidable problem, is simplified relative 
to its neutron star counterpart by the fact that it docs not involve matter fields. For neutron stars, 
the need to solve the equations of general relativistic hydrodynamics in concert with the Einstein 
equations leads to a host of other difliculties, including the need to deal with shocks, turbulence 
and uncertainties concerning the equation of state. Moreover, the parameter space describing the 
generic collision of two compact object has many dimensions, so that even with improved sim- 
ulation techniques, identifying and extracting the key physics from the calculations will present 
a huge computational challenge for years to come. Here wc should emphasize that calculations 
of interacting binaries must be done in three spatial dimensions and time, so that even a single 
calculation that is adequately resolved requires the use of high performance computing facilities. 

Given this state of affairs there is considerable motivation to look for simplified "toy models" , 
with the hope and expectation that they can provide insight into aspects of the compact binary 
problem, especially for the neutron star case. We have already noted that the plunge and merger 
phase of neutron stars in inspiral is characterized by a strong and dynamical gravitational field. 
At least heuristically, the dynamics is dominated by the bulk motion of the two stars, so that 
localized features in the matter — such as individual shocks, or small-scale turbulence — should have 
relatively little impact on the overall dynamics, or on the gravitational radiation which is produced. 

^An object with mass, M, is gravitationally compact if its radius, R, is close to its Schwarzschild radius, Rg, 
defined by Rg = IGMjc?, where G is Newton's gravitational constant, and c is the speed of light. For (spherical) 
black holes R = Rs-, whereas for neutron stars Rg/R is typically in the range 0.04-0.27. 
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Wc thus look for a matter model which can describe gravitationally compact objects, but where the 
equations of motion for the matter are easier to treat computationally than those for a relativistic 
fluid. 

The studies in this thesis involve precisely such a matter model. Specifically, we adopt a single, 
massive, complex scalar field as a matter source. The model admits localized, time- independent, 
gravitationally-bound configurations known as boson stars, which, through an appropriate choice 
of parameters, can be made compact. We can also set up initial data representing two stars, with 
subsequent evolution describing a variety of different kinds of encounters. The equation of motion 
for the scalar field is simply the general relativistic wave equation, or Klein-Gordon equation, whose 
numerical solution using finite difference techniques is quite straightforward. 

Moreover, in this thesis we take the "toy model" approach one step further. Although it is 
certainly possible to simulate boson stars in a fully general relativistic setting [6l [71 IH IH [10] , we 
opt to study them within the context of a relatively simple approximation to general relativity 
which has previously been used to study strong-gravity effects in several scenarios of astrophysical 
interest, including the interaction of neutron stars. 

More specifically, and as will be discussed in detail in Chap. [U this approximation is based on 
the conformally flat condition, or CFC, for the spatial part of the metric (which is the fundamental 
dynamical variable that describes the gravitational field) , along with the maximal slicing condition, 
which fixes the time coordinatization of the spacetime. Although the literature has traditionally 
used the acronym CFC for the resulting approximation to Einsteinian gravity, we prefer to use CPA, 
for conformally flat approximation. This stresses the fact that we are dealing with an approximation 
to general relativity. Also, we emphasize that although neither acronym makes explicit reference 
to the maximal slicing condition, that choice of time coordinatization has always been an essential 
ingredient of the approximation, and is so here. 

The CPA is based on the heuristic assumption that the dynamical degrees of freedom of the 
gravitational field, i.e. those associated with gravitational radiation, play a small role in at least 
some phases of the strong field interaction of a merging binary, and on the related fact that the 
amount of energy contained in the waves, expressed as a fraction of the total mass-energy of the 
system, tends to be small. The CPA effectively eliminates the two dynamical degrees of freedom 
present in general relativity, but still allows for investigation of the same kinds of phenomena ob- 
served in the full general relativistic case. These include the description of compact objects, the 
dynamics of their interaction, and black hole formation. It is also worth mentioning that it is still 
possible to study gravitational wave generation within this approximation via a perturbative mul- 
tipole expansion of the metric components. Briefly, the incorporation of radiation effects, although 
far from a trivial matter, can be realized through the introduction of a radiation reaction potential 
in the equations of motion for the matter model. We also note that for spherically symmetric sys- 
tems the CPA is not an approximation, but can always be adopted through an appropriate choice 
of coordinates. Furthermore, use of the approximation for axially symmetric problems has indi- 
cated that the results obtained mimic those of general relativity quite accurately [11] [T21 [El [M] . 
There is thus considerable motivation to perform additional studies of strong-field gravity using 
this approach. 

The CPA was first studied in a theoretical/mathematical context by Isenberg in the 1970's 
[I5]. and applied numerically (and independently) by Wilson et al. [T6l[T7l[T8l[T9l[20l[2Tl[22l[23] 
in the 1990's to the study of coalescing neutron-star binaries. In this prior work, Wilson and 
his collaborators presented evidence that, for a realistic neutron-star equation of state, general 
relativistic effects might cause the stars to individually collapse to black holes prior to merging. 
Furthermore, they observed that at least for some set of initial orbital parameters, strong-field 
effects caused the last innermost stable circular orbit, or ISCO, to occur at a larger separation 
distance, and thus at lower frequency, than was previously estimated by post-Newtonian methods. 
This result had significance for the possible detection of gravitational waves, since it placed the 
frequency of radiation from coalescence closer to the maximum sensitivity range of current laser- 
interferometric detectors. However, the Wilson-Mathews compression effect was unexpected and 
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controversial, and raised questions concerning the validity of the CFA. 

Subsequently, Flanagan [23] identified an inconsistency in the derivation of some of the equations 
of motion used in this study, and suggested that use of the correct equations would reduce the 
crushing effect. A revised version of simulations was published shortly thereafter [2S|: a key claim 
resulting from this work was that the crushing effect was still present, although the magnitude of 
the observed effect was reduced relative to the previous calculations. Further comparisons between 
a fully relativistic code and its CFA counterpart in the context of head-on collisions of neutron 
stars showed the presence of this effect and lent more credibility to the earlier calculations [26] . 
However, the most recent simulations [27] aimed at studying the possible crushing phenomenon 
actually indicate a decompressing effect on the neutron stars. Still, this result is not in direct 
contradiction to Wilson et al's results since the initial data used for the two sets of simulations 
differ. For a more complete review of the history of this controversy, as well as a possible explanation 
for the neutron star crushing effect, the reader should refer to the work by Favata [28] and references 
therein. We should also note at this point that since this thesis work was started, new "waveless" 
formalisms have been developed [521 130] , and have been used to improve the accuracy of certain 
compact binary calculations [31] in the inspiral phase, relative to fully relativistic computations. 

An ultimate goal of the work started in this thesis is to determine to what extent the CFA is 
a good approximation for the modelling of general compact binaries. From this point of view, it is 
particularly interesting to study the CFA within the context of a simpler matter model than that 
previously adopted, and this provides an important motivation for our use of boson stars. Some of 
the main questions we wished to address are as follows: 

• Would boson stars collapse individually before merger, or is this phenomena strongly depen- 
dent on how the matter is modelled? Favata j28j posited a mechanism that would tend to 
compress the neutron stars given a particular set of conditions. Does that analysis apply to 
the binary boson star as well? 

• How well does the CFA approximate the full general relativistic equations? Can we shed some 
light on the nature of the radiative degrees of freedom in the full theory from a detailed study 
of the differences between results obtained using the CFA and the full Einstein equations? 

• What is the final result of the merger? Can we compare the results to those obtained from 
other techniques, including fully general relativistic calculations? 

• Where is the ISCO? Do the results obtained match those seen in collisions of fluid stars, or 
can they at least be compared qualitatively? 

In order to answer these and related questions, the Einstein equations (and ultimately the 
equations that result from adopting the CFA) have to be cast in a form suitable for numerical 
computation. What follows is a brief description of our modelling procedure; more details will be 
provided in subsequent chapters. 

The current work makes use of the 3 -I- 1 or ADM (Arnowitt-Deser-Misner) [32] decomposition 
of the Einstein equations. One of the main features of the 3 -I- I approach, which underlies the 
majority of the work in numerical relativity!^, is that it provides a prescription for disentangling the 
dynamical field variables from those associated with the coordinate invariance of general relativity. 
Specifically, the spacetime metric components are grouped into 4 kincmatical components which 
encode the coordinate freedom of the theory (the lapse function and three components of the shift 
vector), and 6 dynamical ones (the components of a 3-dimensional metric induced in a constant- 
time hypersurface). Now, for any specific calculation, the coordinate system must be completely 
fixed by giving prescriptions for the lapse and shift. In particular, the time coordinate is fixed by 
a choice of the lapse function, and, as already mentioned, in this work wc follow previous studies 



^Numerical relativity can be defined as the subficld of general relativity that is concerned with the solution of 
the Einstein equations, as well as the equations of motion for any matter sources, using computational methods. 
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using the CFA and adopt so-called maximal slicing (a very commonly adopted slicing condition in 
numerical relativity that was originally proposed by Lichnerowicz |33j). A key property of maximal 
slicing is that its use inhibits the focusing of the world lines of observers that move orthogonally to 
the hypersurfaces: such focusing can result in the development of coordinate singularities, and is 
also associated with the formation of physical singularities. Another important point is that this 
coordinate choice generally results in a well-posed elliptic equation for the lapse function. 

As the name suggests, the CFA requires the spatial 3-geometry to be conformally flat. This 
reduces the number of independent components of the spatial metric from 6 to 1, and the single, 
non-trivial function that then defines the spatial metric is called the conformal factor. Given 
the maximal slicing condition, it transpires that conformal flatness implies that all non-trivial 
components of the 4-mctric are governed by elliptic equations. 

Within the CFA, the dynamics of the gravitational field is completely determined by the dynam- 
ics of the matter source(s), which in our case is a complex scalar field that satisfies the Klein-Gordon 
equation. Overall then, the model considered here — which we will hereafter frequently refer to as 
"our model" — is governed by a mixed system of 4 first-order hyperbolic partial differential equa- 
tions (PDEs), coming from the complex Klein-Gordon equation, and 5 elliptic PDEs. We treat this 
system as an initial-boundary value problem and the thesis focuses on the development, testing 
and preliminary use of a code for its numerical solution. Our computational approach is based 
on finite differencing [34], and we use approximations that are second-order accurate in the grid 
spacing. 

A numerical evolution of our model starts with the specification of initial conditions for the 
complex scalar field. For example, for the relatively simple case of two identical boson stars that are 
initially at rest, the solution of the coupled Einstein/scalar field equations in spherical symmetry, 
and with a time-periodic ansatz for the scalar field, provides a static star profile that can be 
duplicated and interpolated onto a three dimensional domain. This sets initial values representing 
a binary system. Discrete versions of the elliptic equations governing the geometric quantities are 
then solved at the initial time using the multigrid method [35]. The matter field values are then 
time advanced using a second order discretization of the Klein-Gordon equation, which, along with 
the elliptic equations for the advanced-time metric unknowns, is solved iteratively. 

Our code has been subjected to a thorough series of tests that assess convergence of the nu- 
merical results with respect to the basic discretization scale. Additional evidence for code validity 
involves the use of conserved quantities, as well as the technique of independent residual evaluation 
(which is defined and discussed in Chap. |4]). Overall, results from these tests indicate that the 
code is correctly solving the equations of motion that define the model. Specific results calculated 
using the code include long-term evolution of an orbiting binary system, (more than two orbital 
periods), as well as high speed head-on collisions. These results are promising and suggest that, 
especially with enhancements such as the incorporation of adaptive mesh refinement and capabili- 
ties for parallel execution, the code will be a powerful tool for investigating strong-gravity effects 
in the interaction of boson stars. 

We now proceed to two short overviews of subjects that are germane to this work. First, in 
Sec. II. f[ the current status of the numerical modelling of gravitationally-compact binary systems 
is summarized. Second, some of the key previous studies of boson stars are reviewed in Sec. 11.21 

1.1 Overview of Numerical Simulations of Compact 
Binaries 

1.1.1 Black Hole Binaries 

As mentioned previously, accurate modelling of compact star binaries is needed to help extract 
the gravitational wave signal from the noise in current and planned terrestrial gravitational wave 
interferometers. This section briefly describes the status of numerical relativistic modelling of these 
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binaries by means of a quick survey of the pertinent literature. We first note that there are several 
good review papers on the general subject, including the following: 

• Rasio & Shapiro's review of coalescing binary neutron stars [36]. This covers work up to 1999 
on coalescing binary neutron stars — using Newtonian, post-Newtonian and semi-relativistic 
approximations — and is highly recommended for familiarizing the interested reader with ear- 
lier research in the field. 

• Baumgarte & Shapiro's extensive review of the use of numerical relativity to model compact 
binaries [37] . 

• Prctorius' more recent review on black hole collisions |38| . 

Efforts to accurately model the collision of compact objects using fully general relativistic 
calculations began with the work on black hole head-on collisions by Smarr and collaborators 
in the 1970's [39]. These calculations established numerical relativity as a sub-field of general 
relativity in its own right, and indicated that one could expect perhaps a few percent of the total 
mass-energy of the system to be emitted in gravitational waves. In the early 1990's, Anninos et 
al. [3^ revisited and extended the head-on calculations. A major result from this effort was the 
demonstration that the extracted gravitational waveform could be well-matched to (perturbative) 
black hole normal modes. In 1993 the Binary Black Hole Grand Challenge Alliance, involving many 
investigators from several different institutions in the US, was founded with the mission to provide 
stable, convergent algorithms to compute the gravitational waveforms from black-hole collisions. 
Although this research spurred the development of computational infrastructure that is still being 
used by the numerical relativity community, the project ended well short of its stated physics goals. 
The most important simulations that resulted from the 5-year effort described the propagation of a 
single black hole through a three dimensional computational domain [41]. Nonetheless, this result 
constituted an important step towards the solution of the binary black hole problem, especially 
when contrasted with previous computations where the black-holes were kept fixed in the domain 
by specific choices of coordinates. It was not until 2004 that a significant amount of orbital motion 
for a black hole binary was successfully simulated, when Briigmann and collaborators published 
results [42] from computations of a full orbital period for a widely-separated black hole pair. 

2005 then saw the publication of breakthrough results by Pretorius [43l [44] , and the beginning 
of the most productive period of binary black hole research. Pretorius presented calculations that 
tracked the evolution of a binary black hole spacetime through several orbits, the plunge and merger 
phase, and into the late-time stage where the final black hole quickly rings down to a near-stationary 
(Kerr) configuration. Furthermore, Pretorius was able to extract a reasonably accurate gravita- 
tional wave signal from the whole evolution. Pretorius' work had an electrifying effect on research in 
the field, and other groups, most notably the University of Texas at Brownsville (UTB) [45 l l46 l l47] 
and NASA/Goddard [351131] teams, were quickly able to produce comparable results using similar 
[50] or different techniques (moving punctures) 311 [49l |45l [461 HT] [51] [52] • Moreover, it was possible 
to compare the various simulations and to show that the results were generally consistent to within 
the level of numerical error [53]. 

Since that time, a large number of distinct simulations have been performed, and some in- 
teresting phenomenology has been unearthed. Particularly noteworthy are the "kicks" that the 
final black holes formed from mergers can experience due to asymmetric emission of gravitational 
radiation during the coalescence [54] [55l (56] [57]. When the merger process is itself sufficiently 
asymmetric — as is the case for the inspiralling collision of two holes with unequal masses — the 
gravitational waves produced carry away net linear momentum. This imparts an equal and op- 
posite momentum, or kick, to the final black hole, and the magnitude of the resulting recoil can 
have astrophysical implications. For example, depending on the mass of the stellar cluster in which 
the binary is embedded, the merged black hole may actually be ejected from the cluster or even 
the host galaxy. Early calculations concerning these kicks — which used non-spinning black holes 
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(no intrinsic angular momentum) — have now been extended to investigate the effects of the black 
hole spins on the recoil velocity of the final black hole [551 [SH ISOl Ell IHH [S3] ■ These studies have 
indicated that impressively large kicks (~ 4000km/s) can result from misalignment of the black 
holes' intrinsic spins and the orbital angular momentum, as well as from large initial spins. In 
addition, pioneering studies by the UTB/Rochester Institute of Technology group [Ml [551 IBSl 157] 
have shown that the black holes' initial spins can have a significant impact on the orbital dynamics, 
as well as on the emitted gravitational waveform at late times. 

Another interesting effect that is still being studied concerns the effect of orbital eccentricity on 
the gravitational radiation waveforms; summaries of recent progress on this problem are given in 
[ill [551 [Sni [701 [7T] . Moreover, comparison of the waveforms resulting from numerical simulations on 
the one hand, and post-Newtonian calculations on the other, is also a rapidly developing industry 
[72l[73l[74l[75l[76]. Especially notable in this regard are the very high accuracy simulations, recently 
reported by the Cornell-Caltech group, that use pseudo-spectral techniques [TT] [78]. Gravitational 
waveforms from some of these studies have now been compared to those calculated from post- 
Newtonian techniques [79] . These comparisons provide conclusive evidence that the best numerical 
results can now provide more accurate waveforms for the late phases of inspiral than those computed 
using post-Newtonian methods. 

Finally, it is worth mentioning two studies of relativistic (high speed) black hole collisions 
[80|, I81j . This type of interaction is of particular current interest since it ties in with the phe- 
nomenology of possible mini-black hole formation in new generations of particle accelerators such 
as the Large Hadron Collider at CERN. Moreover, ultrarelativistic black-hole collisions represent 
largely unexplored territory where fundamental theoretical issues might be addressed. These in- 
clude the possibilities of naked singularity formation, as well as the production of an end state 
containing three or more black holes, starting from a two black hole initial state. Although the 
preliminary results show no evidence for naked singularities, they do provide explicit demonstration 
of the copious amount of gravitational radiation that can be emitted from such encounters. Specif- 
ically, in some of the cases simulated, as much as 25% of the initial rest-mass energy is emitted as 
gravitational waves. This approaches the upper bound of 29% estimated by Penrose [82[ [83] . 

1.1.2 Neutron Star Binaries 

The first fully relativistic simulation of binary neutron star inspiral and merger was performed by 
Shibata and Uryii in 2000 |84| . Adopting a simple equation of state, they examined the issue of 
the nature of the end state from the coalescence of two equal-mass stars. Perhaps not surprisingly, 
their work indicated that the final configuration depended on the compactness of the initial neutron 
stars: more compact stars would lead to black hole formation, while those less condensed would 
merge to form a high-mass, differentially rotating neutron star. In 2002, subsequent work by 
the same authors, using increased resolution in the simulations, produced gravitational waveform 
estimates for the mergers [85] . Additional research by Shibata and his collaborators (8^ has focused 
on unequal mass binaries, as well as on the use of several different approaches to mimic realistic 
equations of state [ST] [88] . 

Work by other groups has followed, and there have now been several reports of stable long term 
simulations that extend over a few inspiraling orbits through to merger (the end state is typically a 
black hole) [89l [90] [H] [92] . It is worth noting that in all of the work mentioned this far, effects due 
to magnetic fields were neglected. However, within the past year or so, preliminary results involving 
magnetized neutron star binaries have started to appear |93i I94| . Finally, three dimensional general 
relativistic simulations of mixed binaries — where one constituent is a black hole and the other is a 
neutron-star — are also underway; see (95] [Ml [HI [98] [99] and references therein. 
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1.2 An Overview of Boson Stars 

The concept of a boson star can be traced back to work by Wheeler and collaborators |100| llOlj who 
considered self-gravitating "lumps" of massless fields (electromagnetic or gravitational) , which they 
dubbed geons. Unfortunately such configurations were soon found to be unstable and research on 
them quickly subsided. However, the geon idea provided impetus for the investigation of scalar fields 
in the context of general relativity. The earliest published work of direct relevance to our study is 
due to Kaup |102| , who pioneered the study of self-gravitating configurations of a massive complex 
scalar field, governed by the coupled system of Einstein and Klein-Gordon (EKG) equations. The 
main result of his work, which assumed spherical symmetry and a harmonic ansatz for the time 
dependence of the scalar field, was to determine the equilibrium states of the system. These are 
precisely the states that we now know as boson stars (Kaup referred to them as "scalar geons"). He 
emphasized the similarities with neutron (fluid) stars, and established the conditions under which 
the stars would be stable (or unstable) to nonadiabatic radial perturbations. Indeed, Kaup was 
primarily interested in the issue of stability, and how resistant to gravitational collapse the massive 
complex scalar field was. 

At about the same time, Ruffini and Bonazzola |103] investigated the same spherically sym- 
metric system, and were also able to determine the equilibrium boson star solutions. However, 
in contrast to Kaup, these authors viewed the model as representing a semi-classical approach in 
which the Klein-Gordon field was fundamentally quantum mechanical. From considerations that 
parallel the calculations performed by Chandrasckhar for fluid stars |104j , one of their main results 
was the notion of a critical mass for boson stars; that is a mass above which the pressure support 
from the star (supposedly arising from the Heisenberg uncertainly principle) would be insufficient 
to resist gravitational collapse. They showed that the critical mass for bosonic matter scaled dif- 
ferently with the particle mass, m, than it did for fermionic (neutron star) matter, and that for 
any reasonable values of m, the resulting sizes and mass of the boson stars would be so small as to 
be astrophysically inconsequential. 

However, many years later, a seminal paper by Colpi and collaborators |105| showed that if 
the scalar field was endowed with a nonlinear self-interaction potential, then by suitable choice 
of the coupling constants appearing in the nonlinear terms, boson stars with masses comparable 
to neutron stars could be achieved using a plausible value for m. Specifically, for the case of a 
A|0|^ potential, and for sufficiently large values of the dimensionlcss coupling constant El, A = 
Xhc/AirGm^, the structure of the static solutions was found to change significantly relative to the 
stars with A = that had originally been studied. For large A the stellar density profile exhibits 
a relatively slow decline out to some coupling-dependent characteristic radius, and this leads to a 
significant enhancement of the star's mass. In fact, it was demonstrated in |105| that as A — >■ oo the 
maximum stellar mass has the same scaling dependence on the particle mass, m, as for fermionic 
stars. Although this result led to revived interest in astrophysical implications of boson stars — 
including the possibility that they might constitute at least some fraction of the dark matter that 
is currently unaccounted for — we must emphasize that no fundamental scalar boson has yet been 
detected, nor is there any credible evidence for the existence of boson stars. 

The work by Colpi et. al. was followed by a burst of activity in boson star research, with several 
groups carrying out investigations focused mainly on their stability properties jl061 11071 11081 1109[ 
11101 1111] . In 1991, Jetzer published a comprehensive review of the subject jll2j . and this remains 
a very useful general reference, especially for newcomers, as does the review by Lee and Pang |113) . 

What one might identify as the modern era of boson star research began with work by Seidel 
and Suen |114| . These authors considered the dynamical evolution of the full spherically-symmetric 
Einstein-Klein-Gordon equations, and were thus able to directly address issues such as dynamical 
stability. Their numerical experiments indicated that the stars, parametrized by the central mod- 
ulus, 00, of the scalar field, belonged to two "branches": one stable, the other unstable. Further, 



''Colpi and collaborators adopt c = h = 1 . 
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and in accord with analogous fluid-star calculations, as well as perturbation theory, the change 
in stability was found to coincide with the boson star that had maximal mass. When perturbed, 
stars that were on the stable branch were found to oscillate and radiate scalar radiation with a 
particular frequency, eventually settling down to a somewhat less massive stable star. On the other 
hand, stars on the unstable branch that were perturbed would either undergo gravitational collapse 
to a black hole or radiate some scalar field and migrate to a configuration on the stable branch 
with smaller mass. Later, in collaboration with Balakrishna, Seidel and Suen |115| extended these 
studies to include excited states and boson stars with nonlinear self-interactions. The dynamics 
of spherically symmetric boson stars have also been studied by other authors, including Choptuik 
and collaborators |1161 1117j , who have investigated the nature of critical gravitational collapse |118j 
in this context. 

In addition to the spherically symmetric work, the past decade or so has seen the accumulation 
of a considerable literature on axially symmetric boson stars. Since most of this work is not closely 
related to ours, we direct the interested reader to Schunck and Mielke |119j . which remains the 
most recent comprehensive review of boson stars. 

Finally, there have been a few previous efforts in which the dynamical evolution of boson stars 
has been studied in three spatial dimensions. This includes work by Balakrishna et. al. [7] and 
Palenzuela et. al. which, in both cases, involved the solution of the full Einstein equations. 

1.3 Summary of Thesis 

The remainder of this thesis is organized as follows: 

Chap. [2] discusses the principal formalisms and (continuum) approximations used in this thesis. 
A quick overview of general relativity is given, primarily to fix notation, and to introduce key 
geometrical quantities (metric, curvature, etc.). This is followed by a review of the 3 + 1 decom- 
position of spacetime, in which the 4-geometry of spacetime is viewed as the time history of the 
3-geometry of a spacelikc hypersurface. We then proceed to a description of our matter model (a 
massive complex scalar field) which includes a derivation of the associated equations of motion for 
the scalar field. This is followed by a section devoted to a general conformal decomposition of the 
3 -f- 1 equations. The conformally fiat approximation is then introduced, and the final form of the 
equations of motion for our model is derived. Related issues such as the boundary conditions and 
the ADM mass are also discussed. 

Chap. [3] is concerned with the generation of initial data for our model. We first give a de- 
scription of the construction of spherically symmetric, ground state boson stars. We then detail 
the procedures we used to interpolate one of two stars onto the three dimensional computational 
domain, and to provide the stars with initial velocities through the application of approximate 
Lorcntz boosts. 

Chap. |4] begins with a review of some basic concepts related to the finite difference approxima- 
tion (FDA) of partial differential equations (PDEs) . This is followed by a description of a simple 
example that illustrates the specific scheme we used to discretize our hyperbolic PDEs. We then 
discuss the strategies and techniques we use to establish the validity of our code and to assess 
the quality of the numerical results. The chapter concludes with a brief overview of the multigrid 
method in general, as well as a description of the specific multigrid algorithm we implemented to 
solve our discretized elliptic PDEs. 

The majority of the original research contributions in this thesis are contained in Chap. [5l The 
chapter starts with a discussion of the construction of the numerical code, including a description of 
the overall algorithmic flow. This is followed by the results from a thorough series of code tests that 
employed several different types of initial data. The dynamics of head-on collisions of boson stars 

^The boson stars discussed thus far, and which constitute the focus of this thesis, are so-called ground state 
boson star, where the spherically-symmetric scalar field profile is nodeless. The excited states have one or more 
nodes (zero-crossings). 
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is then considered. The last part of the chapter discusses resuUs related to orbital dynamics. We 
perform a modest parameter space survey and identify three distinct endstates for the calculations: 
1) long term orbital motion, 2) merger to a conjectured rotating and pulsating boson star, and 3) 
merger to a conjectured black hole. 

App. El documents the use of a publicly-available FORTRAN subroutine that generates initial 
data for a spherically-symmetric boson star, and which implements a general polynomial self- 
interaction term for the scalar field. We feel that this routine should be of significant utility to 
others who wish to study the dynamics of general relativistic boson stars. 

App. [B] lists all of the finite difference formulae we have used in the thesis and includes a 
note on the rcgularization of differential operators in curvilinear coordinate systems prior to finite- 
differencing. 

App. [C] contains brief documentation of a set of Maple procedures which facilitate the construc- 
tion and coding of finite-difference discretizations of generic systems of PDEs. Again, some effort 
has been expended in designing the routine so that it is potentially useful in other contexts, and 
for other researchers. 

Finally, App. |D] contains a brief review of the relaxation techniques that we have used in this 
work. 

1.4 Conventions, Notation and Units 

We adopt the abstract index notation for tensors as defined and discussed in Wald [120] . In par- 
ticular, letters from the beginning of the Latin alphabet, {a, b, c, ...}, denote abstract indices. We 
then use two sets of indices for tensor components: Greek indices {/x, ...} range over the spacetime 
values 0, 1, 2, 3 (where is the time index), while the subset of Latin indices {«, j, k, I, m, n} range 
over the spatial values 1, 2, 3. The Einstein summation convention applies to both of these compo- 
nent index types. We also adopt Wald's sign conventions, so, in particular, the metric signature is 
(-,+,+,+). 

The totally symmetric and totally antisymmetric parts of a tensor of type (0, 2) arc defined by 

T(ab) = -^^Tab +Tba) , (1-1) 

and ^ 

T[ab] = -^{Tab - Tba) , (1-2) 

respectively. 

Additionally, we use a common terminology from computational science in which the sets of 
time-dependent PDEs with dependence on 1, 2 and 3 spatial dimensions (independent variables) are 
referred to as ID, 2D and 3D, respectively. In particular, PDEs describing spherically symmetric 
systems in which the field variables depend on time and a radial coordinate, r, are referred to as 
ID. 

A variety of differentiation operators are used below. Va and Da denote the covariant deriva- 
tive operators compatible with the spacetime metric, gab, and induced hypersurface metric, 7ab, 
respectively. The Lie derivative along a vector field is denoted by Cy. Ordinary derivatives are 
represented by different notations according to the context. For example the ordinary derivative of 
the function / = f{x, y, z) with respect to the coordinate x will be denoted in one of the following 
ways: df/dx = dxf = f,x- For time dependent functions, g = g{t,x,y, z), we also sometimes de- 
note temporal differentiation using an overdot, so g = dg/dt = dtg. Similarly, for functions of one 
spatial variable, g = g{t, r) ox g = g{r), we sometimes use the prime notation for the first (spatial) 
derivative: g' = dg/dr or g' = dg/dr. Finally, a variant of geometric units is used throughout this 
thesis. Specifically, Newton's gravitational constant, G, the speed of light in vacuum, c, and the 
scalar field mass parameter, to, are all set to unity: G = c = to = 1. 
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Chapter 2 

Formalism and Equations of Motion 



As described in the introductory chapter, this thesis focuses on the numerical solution of a model 
that describes the gravitational interaction of boson stars within an approximation to Einstein's 
general theory of relativity. The bulk of this chapter is devoted to a discussion of the formalism 
underlying the model, and the derivation of the set of partial differential equations (PDEs) which 
govern it. We start with a brief review of the general relativistic field equations in Sec. 12.11 We 
then proceed to a discussion of the so called 3 + 1 (or ADM) formalism which underlies most of the 
computational work[f| in which systems of general relativistic partial differential equations are solved 
as initial value problems (or initial-boundary values problems). In Sec. 12.31 we discuss the matter 
content of our model, which is a massive complex scalar field. The equation of motion for the scalar 
field is derived, as are related quantities, including its stress-energy tensor. Sec. l2.4l then provides a 
definition of the approximation we make to the Einstein equations, as well as a detailed derivation 
of the PDEs that, using this approximation, then govern our model. The key element in the 
approximation is the demand that the 3-metric (which is the fundamental dynamical variable in the 
3-1-1 approach to general relativity) be conformal to a flat metric. In addition to this requirement, 
a specific choice of time coordinatization — known as maximal slicing — is made (Sec. I2.4.ip . This 
combination of an assumption of conformal flatness and a choice of maximal slicing leads to what 
we call the conformally flat approximation (or CFA) of general relativity. 

Principally for the sake of completeness. Sec. 12.4.21 contains a review of a speciflc approach to 
the conformal decomposition of the 3-1-1 form of the Einstein equations. Once this is in hand, 
it is a relatively straightforward matter to impose conformal flatness, adopt maximal slicing and 
then derive the equations of motion for our model (Sec. 12.4^ . Ultimately this results in a system 
of nonlinear PDEs of mixed elliptic-hyperbolic type that are written in Cartesian coordinates 
(Sec. I2.4.4|) . In Sec. 12.4.51 we discuss three different strategies that we have investigated for the 
implementation of boundary conditions for this system. This is followed by a deflnition of the 
ADM mass and a short description of our use of that quantity as a diagnostic in our computations 
(Sec. I2.4!6|) . Sec. I2.5l then concludes the chapter with a recap of the equations of motion, as well as 
a synopsis of how they were solved numerically. Thus, the reader interested only in the final form 
of the equations of motion may wish to proceed directly to that section. 

2.1 The Field Equations of General Relativity 

Space and time in general relativity are modelled as a four dimensional Lorentzian manifold, Ai. 
Each point in the spacetime manifold corresponds to a physical event. The notion of distance 
between two points or, in the spacetime case, the interval between two events, is encoded in a 
symmetric, non-degenerate, tensor field of type (0,2), the metric gab- The metric is thus the 
fundamental entity used to quantify the geometry of spacetime, and gravitational effects arise due 
to the fact that this geometry is, in general, curved. 

Let {x^^} be a coordinate system that we assume covers that part of the manifold which is 
of interest. The vector and dual vector bases associated with this coordinate system are called 
coordinate bases and arc usually written as {{d/dx'^)'^} and {{dx'^)^}, respectively. Therefore the 



^Here we note that an alternate formulation of Einstein's equations, known as the generalized harmonie approach, 
has also had a major impact on numerical relativity in the past few years. 
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metric components in this coordinate basis can be defined by: 

gab^ds^ ^g^,{dxf^)Jdxnf (2.1) 

The covariant derivative map, Va, is written in terms of the ordinary (partial) derivative map, 
da, via 

Vav' ^dav' +T\,V^, (2.2) 

where T^ac is the Christoffel symbol and is an arbitrary vector. Throughout this thesis we use a 
covariant derivative that is compatible with the metric, i.e. that satisfies V aQbc ~ 0. The ChristofTel 
symbol can then be calculated in terms of the ordinary derivative operator: 

T'ab = ig""^ [daMd + db9ad - ddOab) ■ (2.3) 

The intrinsic notion of curvature of the spacetime can be made mathematically precise by 
considering the parallel transport of vectors along curves in the spacetime. H For example, a 
spacetime is curved in a region if an arbitrary vector that is parallel transported along a closed 
path that bounds that region experiences an overall rotation. Another consequence of curvature is 
that the result of parallel transporting a vector is, in general, path dependent. In fact, the failure 
of successive applications of the covariant derivative to commute captures the path-dependence of 
parallel transport (along with the notion of curvature) and is quantified by the so called intrinsic 
curvature tensor, the Riemann curvature tensor, defined by: 

2V[a'^b]Wc^ Rab/Wd. (2.4) 

Contractions of the Riemann tensor give rise to the Ricci tensor: 

Rac = Radc"^ , (2.5) 

and to the Ricci scalar or scalar curvature: 

R = g'^'^Rab- (2.6) 

A key feature of the Riemann curvature tensor is that it satisfies the contracted Bianchi identity 
which can be expressed as the fact that the Einstein tensor, defined in terms of the Ricci tensor 
and scalar curvature as ^ 

Gab = Rab — -^QabR , (2.7) 

has vanishing divergence: 

V'Cb = 0. (2.8) 

Given a metric, gab, the Riemann curvature tensor can be written in terms of the Christoffel 
symbols through the following tensorial equation: 

Rab/ = -2 {dlaV^^c - r''c[ar'*6]e) ■ (2.9) 

Once a coordinate basis is chosen, the Riemann tensor components can be computed from the 
metric components and its various derivatives in that basis. In particular, since from p.3p the 
Christoffel symbol components involve first derivatives of the metric components, the components 
of the curvature tensor generally involve second derivatives of the metric. 

The curvature of spacetime arises in response to the distribution of matter-energy in the space- 
time, which includes contributions from the gravitational field itself, as well as those from any 
matter fields present in the spacetime. For any given matter field, the coupling to gravity is 



^See Chap. 3 of [120) for a complete and detailed discussion of this material. 
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described by the stress-energy-monientum tensor (stress-energy for short): Tab- In rnany cases, 
including that considered in this thesis, the equation of motion for the matter field can be derived 
from the vanishing of the divergence of the stress tensor, 

V^Tab = 0. (2.10) 

The Einstein field equations expresses the intimate relationship between the physical phenomena 
taking place in the spacetime and the curvature of the spacetime geometry itself. In their most 
compact and elegant form the field equations are: 

Gab = Rab - -j^QabR = uTab, (2-11) 

where the constant k, depends on the system of units, but is also chosen so that in the weak-field 
limit, the Newtonian description of gravity is recovered. In the geometric units used in this thesis 
we have k = Stt. 

Once a coordinate basis is chosen, the Einstein equations can be cast as a system of 10 nonlinear 
second order partial differential equations for the metric components, (7^1/, in the chosen coordinate 
system. Since the metric signature is Lorentzian, i.e. (— , +, +, +), this system of partial differential 
equations has a hyperbolic or wave-like character. 

Although there have been many "exact" (closed form) solutions of the Einstein equations dis- 
covered over the years, very few of these are considered to be of significant physical interest, and 
even fewer are thought to be relevant to astrophysics. Typically, exact solutions are obtained by 
demanding symmetries: the spherically symmetric Schwarzschild solution for a single black hole, 
as well as the Kerr solution, which is axially symmetric and generalizes Schwarzschild to the case 
of rotating black holes, are among the most important examples. However, for situations in which 
1) the gravitational field is strong and highly dynamical, 2) there are no symmetries, and 3) one 
does not have substantial a priori information about the expected nature of the solution, numeri- 
cal analysis provides the only currently viable general approach for solving the Einstein equations. 
This, of course, is precisely the situation that confronts us in our current study of interacting boson 
stars. 

We end this brief overview of general relativity, with the observation that the complex, tensorial 
nature of the Einstein field equations has fostered the development of many different approaches 
to their expression and analysis. We will refer to any such approach as a formalism for general 
relativity, and in this thesis will focus exclusively on formalisms that are appropriate for solving the 
field equations as an initial- value problem. That is, we are interested in approaches where the state 
of the gravitational field — defined in terms of some set of dynamical variables for the field — can 
be prescribed at some initial time. The initial data will then be evolved to the future (or past) 
using partial differential equations of motion for the dynamical variables. A given formalism must 
thus include prescriptions for 1) the decomposition of spacetime into space and time, 2) the choice 
of dynamical and auxiliary variables, and 3) the choice of time and space coordinates. Crucially, 
it must also provide a specific form of the equations of motion that will be suitable for use in 
numerical computations. As mentioned previously, the 3-1-1 (or ADM) formalism has formed the 
basis for most work to date in numerical relativity. It is also the fundamental approach that we 
adopt here, and is the topic of the next section. 



^Here we should emphasize that developments over the past decade or so — both theoretical and computational — 
have made it clear that the traditional form of the 3-1-1 equations is not generally suitable for fully 3D numerical 
calculations. This is due to the fact that this form of the field equations is not strongly hyperbolic, and thus does 
not lead, in general cases, to a well posed initial value problem. However, this fact is not relevant to our work since 
as a result of the approximation we adopt, the quantities describing the gravitational field arc all governed by elliptic 
rather than hyperbolic PDEs. 
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2.2 The 3 + 1 (ADM) Formalism 

As we have just indicated, in the 3+1 (or ADM) formahsm, spacetime is decomposed into space 
and time. E| Fundamental to this approach is the choice of a timeUke unit vector field, in the 
spacetime and a foliation of spacelike hypersurfaces, E*, parametrized by a time function. The 
timelike vector field is chosen such that its integral curves represent the time coordinate (or time 
function), t, throughout the spacetime, i.e. such that t"Vot =1- In essence the vector field 
and the time function are chosen to enable the definition of the notion of dynamical evolution for 
quantities defined on the hypersurfaces. In order to make the last statement precise, it is necessary 
to decompose spacetime vector fields (as well as tensor fields of higher rank) into (vector) pieces 
that are either defined 1) exclusively on any given hypersurface, or 2) in a direction normal to the 
hypersurface. For example, for the vector field t° itself we write 

t" = an"" + , (2.12) 

where n" is the future-directed, timelike unit vector field normal to (thus satisfying gabn'^n^ = ~1 
with our convention for the metric signature), a is the function that gives the component of 
in the normal direction, and /3° is a vector field that resides on the hypersurface (thus satisfying 
9ab'n°'ff' = 0). In 3 + 1 parlance, a and /3" are known as the lapse function and shift vector, 
respectively. 

From gabn^-n^ = -1 and gabn^P^ = it follows from ([^1^ that 

a = -gabt'^n'' . (2.13) 

Introducing the projection tensor, defined by 

7% = 5\ + n°nb, (2.14) 

and also sometimes denoted J." h, it follows from (|2.12|) and (|2.14|) that the shift vector is given by 

r=7V'. (2.15) 

We also note that in order for t°Vat = 1 to be satisfied, (|2.13p implies that we must have 

ria = gabn!' = -aVat. (2.16) 

The decomposition of vectors into "temporal" and "spatial" parts can be readily generalized to 
tensors of arbitrary rank. Following York jl21j . for any vector field, W"" , we define 

W^ = -W''na, (2.17) 

and, in general, any upstairs fi index denotes that the original tensor index has been contracted 
with —na- On the other hand, for a dual vector field, Wa, York defines 

Wf, = +Wan'' , (2.18) 

and then any downstairs h index denotes contraction with 

Of special interest are tensors which have been completely projected onto the hypersurface. For 
a general tensor, T°''' ' cd... we write 

± T'^^-ed... = 7^7'/ ■ • •7^c7^ • ■■T^^ -gh... (2.19) 



*The development in this section closely parallels that of York in |121| . 
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and _L T'^'^ ' cd... is called a spatial tensor since 

ua ± T'^' -cd... = rib ± T'^^ -ed... . . . = n= ± T'^'-'cd... = ri'^ ^ r"''-,d... . . • = . (2.20) 

One particularly important spatial tensor is the induced three dimensional metric, jab, of the 
hypersurface 

lab = gab + naUb , (2-21) 

which, the reader should note, is also given by lowering the upstairs index of the projection tensor, 
l-" b = l\- 

In order to describe parallel transport of spatial tensors and curvature within the hypersurface, 
a covariant derivative operator must be defined. A natural choice is to project the four dimensional 
covariant derivative onto the spacelike hypersurface, leading to the definition of a three dimensional 
covariant derivative operator. Da'. 

Da=±Va. (2.22) 

The Riemann curvature tensor on the hypersurface is defined analogously to its four dimensional 
counterpart: 

2D[aDh]Wc^nab/wd, (2.23) 

while the Ricci tensor and Ricci scalar are obtained by the usual contractions of the Riemann 
curvature tensor: 

nab^nacb' and 7^ = 7^^. (2.24) 

As mentioned previously, the Riemann curvature tensor describes the curvature intrinsic to a 
manifold. In the current case, which involves the embedding of three-dimensional hypersurfaces in 
a four-dimensional spacetime, there is a second type of curvature, known as the extrinsic curvature, 
that quantifies the embedding. Since the orientation of the hypersurface within the spacetime is 
related to the unit normal vector, n", the covariant derivative of thus characterizes nearby 
changes in the orientation. The extrinsic curvature tensor, Kab, can therefore be defined as the 
projection of the covariant derivative of the dual vector field associated to the normal vector field: 

Kab = - -L VaTlb = -i _L £„5a6 = -^AiTab- (2.25) 

Since both sides of the Einstein equations must be decomposed in the 3-1-1 approach, we must 
also consider various projections of the stress-energy tensor along the normal n° and onto the 
hypersurface, Sj. First note that the stress-energy tensor Tab is a type (0,2) symmetric tensor. A 
generic tensor of this type can be decomposed in the following way in the 3-1-1 formalism: 

Tab =-L Tab - 2ri,(a _L T^n + naUbTnn- (2.26) 

We rewrite the above as 

Tab = Sab - '2J{anb) + priaUb, (2.27) 
where the quantities, p, Ja and Sab are defined by 

p = Tfif, = r,6n V = T'^V^f,, (2.28) 

Ja = ±Tan=^{Tabn''), (2.29) 

J" EE _L T°" = - ± (r"''nb), (2.30) 
Sab = ^Tab, (2.31) 



Physically, p is interpreted as the local energy density, J° as the momentum density, and Sab as 
the spatial stress tensor, all measured by observers moving orthogonally to the slices. 
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The several possible combinations of projections of Einstein equations along the "temporal" 
and "spatial" directions give rise to the equations of motion in the 3+1 form. Projecting both 
indices with n° we find 

n + K^- KabK"'' = IGnp, (2.32) 

where K = K"^ a is the trace of the extrinsic curvature. Eq. (|2.32p is also known as Hamiltonian 
constraint. On the other hand, if only one index is contracted along ri", while the other is projected 
onto the hypersurface, we derive a three-vector equation known as the momentum constraint: 

DbK""^ - D^K = SttJ". (2.33) 

We note that care must be exercised in using the covariant form of this equation, since due 
to the relative sign in the definitions of Ja and J" in equations (j2.29p and (|2.30p (see definitions 
((2T7)) and ((2l8l) ). we have 

±Gan = -D''Kah + DaK = SirJa, (2.34) 
_L G"'" = DbK"'' - D^K = %TTr. (2.35) 

Key features of the constraint equations are the presence of only spatial tensors and the absence 
of explicit time derivative of these tensors. They must be satisfied by {"yabi Kab\ on all slices, 
including the initial slice. 

The 3+1 equations that do involve time derivatives of the spatial tensors {'^ab,Kab}, are thus 
called evolution equations. For the spatial metric, an evolution equation follows from the definition 
of the extrinsic curvature (|2.25|) : 

Ctlab = -2aKab + Cp^ab- (2.36) 

where Ct is the Lie derivative with respect to the vector field t"". We note here that the Lie 
derivative of a general tensor, T°-^---°'''bi...bii with respect to a vector field, v°- , is defined by:[2| 

k 

= v'D,T''^-''%,,„b,-Y,T''^-'-'"^b,...b,Dcv''^ 

i=l 

I 

-^Y,T'''-'"'b,...c..MDb,v' ■ (2.37) 

The evolution equation for the extrinsic curvature can be derived by considering the projec- 
tion of both indices of Einstein equations, which involves computation of + Ranah- After some 
manipulation, we find 



CtK^b = CpK^b - D^Dba + a 



n\ + KK\ + 8^ ( ^j\{S -p)- S\ 



(2.38) 



All of the definitions and decompositions discussed thus far are independent of any choice of 
coordinate system. Operationally however, we must introduce coordinates, = {t,x'^), in order 
to cast (|2.32p . (|2.33|) . p.36|) and p.38p as a system of partial differential equations that can then 
be solved — using a numerical approach in general — for given initial data and boundary conditions 
(we remind the reader that Greek indices such as /i range over the spacetime values, 0,1,2,3, while 
Latin indices such as i are restricted to spatial values, 1,2 and 3). We thus adopt such a coordinate 
system, with the time coordinate, t, being identified with the time function. In terms of tensor 



^We should emphasize that this expression is valid for any derivative operator Da- The reader should refer to 
App. C of Wald [120) for a comprehensive discussion. 
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components taken with respect to the coordinate basis, the spacetime displacement can be written 
as 



^ -a^dt^ + -fij {dx^ + P'dt) {dx^ + put) 



(2.39) 



We emphasize that the lapse function, a, the three shift vector components, /3% and the six com- 
ponents of the symmetric 3-metric, ^ij, that appear in the above expression are all functions of 
the coordinates x^^ = We also note that the lapse function can be interpreted as the ratio 

of proper time to coordinate time for an observer travelling normally to the hypersurface, while 
the shift vector encodes the translation of spatial coordinates from one slice to the other, again 
relative to propagation in the normal direction. In addition, component indices of spatial tensors 
are lowered and raised with the 3-nietric jij and its inverse 7*^ , respectively, where 7*-' is defined 
by l'''lkj = S'j. 

As discussed above, the decomposition of the stress-energy tensor gives rises to a variety of 



energy-momentum quantities defined by equations 
and Ufj, = (— a;0), these quantities become 



Using the relations 



00 
2 



a 
a 



2r7i00 



(l/a;-/3Va) 



(2.40) 



ML, 

a 



J* = a(r" + r""/3*), 

Sij = Tij . 

The component form of the evolution equations can then be written as 



(2.41) 

(2.42) 
(2.43) 

(2.44) 



and 



S 



KK' 



8tt 



(2.45) 



We will use these versions of the evolution equations for the components of the 3-metric and extrinsic 
curvature when discussing spherically symmetric spacetimes in Ch. [3] However, for the purpose 
of performing a conformal decomposition of the 3-1-1 evolution equations — which is done later in 
the current chapter — we will start from slightly different forms which may be easily derived from 
previous formulae using straightforward tensor calculus. Specifically, for the 3-metric components 
we can also write 



which follows immediately from (|2.36p as well as 

This last result is easily established from (|2.46|) and the fact that 



(2.46) 



(2.47) 



=^ 7jfc'C(t-^)7'^ = -7'^'C(t-/3)7jfc ■ 
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For the extrinsic curvature components, Kij, we have 




(2.48) 



Additionally, the constraint equations in component form are: 



DjK'^-D'K = 8nJ\ 



(2.49) 
(2.50) 



Even within a specific formulation of the Einstein equations, such as the 3+1 approach described 
above, the coordinate invariancc of general relativity ensures that there are generally many distinct 
possibilities to solve the specific set of PDEs that results once the coordinate system has been fully 
fixed (full specification of the lapse and shift). Here we are referring to the fact that we have 
more equations (4 second-order "elliptic" constraints + 12 first-order- in-time evolution for a total 
of 16 equations) than fundamental dynamical unknowns (6 gij + 6 Kij = 12 unknowns). The 
interested reader is referred to the classic paper by Piran |122| in which nomenclature, such as 
free evolution, constrained evolution and partially-constrained evolution is defined and discussed. 
Here, the key thing to note is that the approximation (CPA) that is adopted in this thesis has 
the advantage of providing a single, well defined set of 5 elliptic PDEs for 5 well defined functions 
that completely fix the spacetime geometry. In this sense, and in an abuse of Piran's original 
classification, we implement a fully constrained evolution for the geometrical field and, further, 
in contrast to the full general-relativistic situation there are no purely gravitational degrees of 
freedom. That means that in the model considered here, as is the case for any model that adopts 
the CPA with maximal slicing condition, all dynamics is linked to the dynamics of the matter. 
This has a host of ramifications, physically, mathematically and computationally, but particularly 
given the efforts that have been expended on taming instabilities in free evolution approaches for 
the full Einstein equations, is one of the most attractive features of Isenberg's proposal. 
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2.3 The Complex Scalar Field 

The matter model adopted in this thesis is a complex Klein- Gordon field, which satisfies a Klein- 
Gordon equation as discussed in detail below. This field represents a simple type of matter that 
when coupled to Einstein gravity, or in the context of the approximation adopted in this thesis, 
admits star-like solutions. Studies focusing on such solutions — known as boson stars — using a 
variety of techniques including numerical analysis, have a rich history and we refer the reader to 
the paper by Schunck and Mielke, [119] (and references therein), for an excellent and thorough 
review of the subject as of about five years ago. 

General relativists have studied Klein-Gordon fields for many purposes over the years. As 
either 1) a classical field or 2) a quantum-theory of spin particles, scalar fields have been widely 
exploited for exploratory theoretical studies. A key point is that the simplicity of scalar matter 
(in terms, e.g. of physical interpretation as well as complexity of the equations of motion), often 
allows one to investigate and understand basic theoretical issues in Einstein gravity relatively free 
of the complications a more realistic matter model could bring in. This is a chief motivation for 
the use of a scalar field in the current work. 

In the discussion below, we will refer to the system of a single Klein-Gordon field minimally 
CO upled0 to Einstein gravity as the Einstein-Klein-Gordon (EKG) system. We note that we adopt 
a complex scalar field, rather than a real one, since it has been long known that there are no regular, 
static solutions (i.e. star-like solutions) for a real scalar field in general relativity. Interestingly, 
for us this turns out to be something of a technical point, since for a real field coupled to Einstein 
gravity there are quasi-static solutions known as "oscillons" which have decay times that can be 
much longer than the intrinsic dynamical time |123j . Thus in principle one could use a real scalar 
field to study some of the effects we wish to examine in this thesis and follow-up work. However, 
for a variety of reasons, not least including the ease with which one can generate star-like solutions, 
we prefer to work with the complex field. 

Additionally, the complex field must interact in an non-trivial potential, which we define to 
include a mass term. The possibilities for potential choice are endless, and have formed the basis 
for much previous work. Again, wc choose the simplest approach and, at least initially, adopt only 
a mass term. The boson stars modelled by scalar fields with this self-interaction potential arc also 
known in the literature as mini-boson stars. 

Since mathematically we are ultimately interested in solving an approximate EKG system as an 
initial value problem, we note in passing that a scalar field is known to admit a well-posed initial 
value formulation in the following sense |120j : 

• For an initial data in a spacelike Cauchy surface E in a globally hyperbolic spacetime 
{Ai,gab), there is an open neighbourhood O of S such that the Klein-Gordon equation has a 
solution in O and {0,gab) is globally hyperbolic. 

• The solution in O is unique and propagates causally. 

• The solution depends continuously on the initial data. 

In addition to possessing star-like solutions, a key advantage of scalar matter relative to the 
more-astrophysically relevant perfect fiuid case, is that the solutions do not tend to develop shocks 
or rarefaction regions. Rather, as is expected from the structure of the equations, and has been 
born out by many [Till [ESI [ml Hm [IIHl [ESI Il26l [ml [Ell H El El [EH [HTl [El [EQ] previous 
numerical studies, solutions tend to remain as smooth as the initial data, except at actual physical 
singularities (produced, for example, by gravitational collapse). 

^"The notion of minimal coupling is defined in the next section. 

^^When the domain of dependence of a Cauchy surface is the whole spacetime (region of interest) then this 
spacetime (region) is said to be globally hyperbolic 
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2.3.1 Einstein-Klein-Gordon System 

One route to study matter models in general relativity is to postulate equations of motion for the 
matter, derive a suitable stress-energy tensor Tab compatible with those equations and then use 
the Einstein equations to relate the matter distribution to the spacetime curvature through this 
stress-energy tensor. However, and as already mentioned, in many cases local conservation of the 
stress-tensor, implies the matter equation of motion. In such cases it is essentially sufficient to 
postulate the stress-energy tensor Tab for the matter model in order to study the coupled system 
of matter distribution and spacetime geometry. 

Additionally, and for a variety of reasons, it is often useful to adopt a Lagrangian (or variational) 
approach to Einstein equations and we will do so here. Here, a basic observation is that the vacuum 
Einstein equations can be obtained from the functional derivative of the so called Hilbert action 
functional; 

SGig""'] = I Cg^ f V~9R. (2.51) 

JM JM 

where Cg = y/^gR is the Einstein Lagrangian density and R the Ricci scalar. It is a standard 
exercise to show that the functional derivative of the action with respect to the inverse metric 
is 

^ = V-gGab, (2.52) 



Sg" 

which then clearly yields the vacuum Einstein equations Gab = when the field configuration 
satisfies the action cxtremization condition: 

P = 0. (2.3) 

In order to obtain a coupled matter-gravity system, one then simply adds to the matter La- 
grangian density to the Hilbert term (this is the so-called minimal coupling prescription). We thus 
have 

C^Cg + anCM, (2.54) 

where aM is a coupling constant that can typically be rescaled through a redefinition of the matter 
fields. In the case of the Einstcin-Kleiu-Gordon system one conventional choice that we adopt here 
is aKG = IStt. 

The stress-energy tensor can now be calculated as the variation of the matter action with respect 
to the inverse metric field Specifically, one has 



rr Q:j\/ 1 SSm 



where Sm is the action functional for the matter field M (understood here as a generic collection 
of matter fields and their higher order covariant derivatives): 

Suig^^M] = / Cm- (2.56) 

JM 

Finally, variations of the action Sm with respect to the matter fields themselves generate the 
equations of motion for the matter. 

For the reasons discussed above, we now restrict attention to matter consisting of a single 
complex scalar field, $. We write the field as 

$ = i02 = (1)0 exp(i6'), (2.57) 



where (f>i, 02, (^o a-nd are real- valued functions of the spacetime coordinates x^. The Lagrangian 
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density associated with this field is 

1 



= --\/^(.g"''Va$Vb$* + C/(|$n), (2.58) 

where iJd'&p) is the scalar field self-interaction potential. As also discussed above, we will eventu- 
ally specialize to the case where U contains only a mass term 

f/(|$P) = to2^$* m^(l)l = m^{(l)l -f <j)l), (2.59) 

but for the time being we will continue the discussion in terms of general potentials. 

We now rewrite the Lagrangian p.58p in terms of the real- valued quantities defined in ()2.57|) : 

= -^V^(5"''V„0iVb0i+g"^Va02Vfc02 + f/(0o))- (2-60) 

Klein-Gordon equations for each real valued component {(pA S {4>ii4'2\) can then be obtained by 
the usual variational procedure, yielding 

A2\ 



"0 



^<I'a-—^<Pa^Q or g'^'-V^Vb^A - — = 0, A = 1,2, (2.61) 



where □ = g°'^Va^b is the general relativistic D'Alambertian operator. 

Once a coordinate system is chosen, each of the above scalar Klein-Gordon equations is a 
second-order-in-timc PDE. In keeping with the 3-1-1 spirit, it is often conventional to recast these 
equations in first-order-in-time form, and we do so here. One specific way of doing this is to pass 
to the Hamiltonian description of the system in the standard fashion. Namely, we consider the 
Lagrangian as a function of the field and its spatial and time derivatives; we define a conjugate mo- 
mentum associated with the field; we write down the Hamiltonian functional from the Lagrangian 
by performing a Legendre transformation for the conjugate momentum; and we then evaluate the 
Hamilton evolution equations from the Lagrangian. Full details of this procedure can be found in 
standard texts such as Wald |120j . and here we simply summarize the results for the scalar field. 

Since the scalar-field Lagrangian p.SSp does not contain time derivatives higher than first order, 
the conjugate momentum associated with each component of the scalar field can be defined as: 

Ha ^ ^^^3L,.\ (2.62) 



or, more explicitly 



(t)A - P'di(l)A 



(2.63) 



where the overdot denotes differentiation with respect to the time coordinate. 

The dynamical equations of motion (|2.6ip can be rewritten in terms of these conjugate fields, 
leading to four first-order-in-time partial differential equations for the two conjugate pairs of vari- 
ables {^AjHa} (where A = 1,2): 

dt(l)A - -^IlA+l3'd^(j>A, (2.64) 



dtUA = d,iP'TlA) + d,iV^Y^d,cf,A)-V^^^^^^<f>A. (2.65) 



d<l>l 



These last equations can be further manipulated using the following relationship between the 
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determinants of the spacetiinc and spatial metrics: 
yielding 



a, 



a 



(2.66) 

(2.67) 
(2.68) 



Having obtained equations of motion for the scalar field, we now consider computation of the 
stress-energy tensor and the 3+1 quantities derived from it. Using the variational prescription 
sketched above we find 



Tat = ^ [Va$V6$* + Vb$Va$* - ffab (ff"" Ve$ Vrf$* + C/($$*))] , 



(2.69) 



which, adopting a coordinate basis, and working with the real-valued field components becomes 

Tap = X! 9 [(^a(t)Adp(l)A + dpCpAdaC^A " Gapg^" d^cj)Adt,(l>A] ~ -^9 a {(^1) . (2.70) 
A=l " 

From the above, and using ()2.40p - ()2.43p . we compute the 3+1 stress-energy quantities and find: 

rm _ ^ 1 1 

9 2^ 



p = 



A=l 



^+Yld,(bAdjCj)A 

7 



r 



E 

A=l 
2 

E 

A=l 



— O^ipA 

a/7 



Ha 

"V7 



f^dj(f>A 



1 ^ r 

- I 2d^(j)Adj(i)A + lij 

^ A=\ ^ 



7 



"f^''dp(t)Adq(j)A 



(2.71) 
(2.72) 
(2.73) 
(2.74) 



Additionally, we need to compute the trace of the spatial stress tensor, S = 5*% as well as the 
combination p + S. These are given by 



1 



E 



A=l 



^^-l''d,4>Ad,4'A 

. 7 



P + S = 



A=l 



2^ 

. 7 



- t/(^g). 



(2.75) 
(2.76) 



2.3.2 Noether Charge 

The invariance of the Klein-Gordon Lagrangian density, Eq. p.58p . under a global U{1) symmetry 
transformation $ $e*^ gives rise to a conserved current density according to Noether's theorem. 



^^This relation is derived from the definition of inverse metric: g"" = —a ^ = (—l)'^^^det{'yij)/det(gij.^) = 7/g. 
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Roughly, this rcsuh can be obtained as foUows: 

First, consider the Klein-Gordon action as a functional of the inverse metric, the scalar field 
and its first covariant derivative, instead of the inverse metric and the scalar field alone, as in the 
last subsection: 

(2.77) 



J M 



Also note that for a scalar field we have Va$ = 9a Defining the variation of a functional or 
function with respect to a parameter e as: 



and 



!=0 



(5$ EE — 

de 



e=0 



the variation of the Klein- Gordon action functional can then be expanded as: 



5S 



KG 



dS 



KG 



M 



r,ah 



5^ 



(5$ 



5S 



KG 



(2.78) 



(2.79) 



Our interest here is in variations that keep the action functional constant; that is, variations such 
that 5Skg = 0. The inverse metric g"'^ is invariant under the action of a U{1) transformation: 
Sg""^ = ^|^|e=o = 0, and the first term in the Eq. (|2.79p drops out. Further simplification of 
Eq. (|2.79|) results from noting that the variation with respect to the field derivative can be rewritten 
as: 



de 



.1 ( '^^ 



dam, 



(2.80) 



since ordinary derivatives commute. Inserting the above relationship in Eq. (|2.79p . we have after 
some simple algebraic manipulation: 



SSkg = 



M 



SSkg 
(5$ 



-da 



6S 



KG 



Sida-f) 



(5$ + da 



SS 



KG 



(5$ 



(2.81) 



The first term of the equation above is simply the Klein-Gordon equation of motion which vanishes 
identically. The second term is a total divergence that can be converted to a surface term using 
Stokes theorem, and which also has to vanish if the action is supposed to be invariant under the field 
variation 5$. This then implies that the current density, j", associated with the U{1) symmetry 
and defined by 

ja ^ 1^S<P, (2.82) 



Sida'f) 



is conserved 

A conserved (Nocther) charge, Qn- 
density: 



Saj" = 0. (2.83) 
is associated with the "time" component of the current 



Qn — 



J 



(2.84) 



where is a spacelike hypersurface as previously, and a fixed volume element on St, e, is under- 
stood in the integration. 

To compute the explicit form of the Nocther current we apply Eq. (|2.82p to the Klein-Gordon 
Lagrangian (|2.58l) . obtaining 



KG 



da 



SS 



KG 



S{da^] 



S<P 



SS 



KG 



S{da^*) 



5$' 



da 



5g°''(9h$*5$ + 9f,$'5$*) 



(2.85) 
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so the current density is 



(2.86) 



For an infinitesimal U{1) transformation, $ — > $ + ie$, wc have 

(5$ = ie$ and S^* = -ie$*, 

and we have 

where the constant e has been factored out. Using the component form of the field, $ = 0i + i02, 
this can also be expressed as: 



(2.87) 



(2.88) 



(2.89) 



Finally the time component of the current density in a 3+1 coordinate basis assumes the following 
form: 

= [5**(029t?!)i - ?!)i9t02) +ff**(025.i0i - 0i(9,(/)2)] 



2dt(f)l - 0l9t02) + 



13' 



2di4>l - 0l9j02) 



6in2 



(2.90) 



where Eq. (|2.64p was used to simplify the second line of the above, and to express dt<j>A in terms 
of their respective conjugate momenta 11^. Choosing the fixed volume element e on to be the 
coordinate volume element cfix, the Noether charge can be written as: 



Qn= f icj)lIl2-cj)2lli)d^X 



(2.91) 



and can be expected to be conserved: i.e. to have the same value on each slice St of the spacetime 
foliation. This expression is used in Chap. [5] as one diagnostic to ensure that the numerical code 
used to solve our model system is producing sensible results. 
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2.4 The Conformally Flat Approximation (CFA) 

This section provides a detailed description of the conformally flat approximation (CFA) of general 
relativity. Our definition of the CFA includes a particular choice of time coordinate, known as 
maximal slicing, which is briefly discussed in Sec. 12.4.11 We continue in Sec. 12.4.21 with a detailed 
review of a specific conformal decomposition of the 3 + 1 Einstein equations. Once the conformal 
Einstein equations have been derived, we introduce the assumption of conformal flatness in 12.4.31 
and derive the simplified set of field equations that result. These equations are covariant with 
respect to a choice of spatial coordinates, and in Sec. l2.4.4l we fix those coordinates to be Cartesian. 
This then yields the actual PDEs that we solve numerically. Sec. 12.4.51 discusses three different 
approaches we investigated for imposing boundary conditions on the PDEs. Finally, Sec. 12.4.61 
describes the definition and calculation of the ADM mass, which we use as a diagnostic quantity 
in our computations. 

2.4.1 Maximal Slicing 

In the initial value, or Cauchy, formulation formulation of Einstein's equations, the choice of time 
coordinate is related to the choice of the spacelike hypersurfaces that foliate spacetime since the 
hypersurfaces are level surfaces of the time coordinate. As briefly discussed in Sec. 12.21 the em- 
bedding of these three dimensional hypersurfaces in the four dimensional spacetime is described 
by the extrinsic curvature Kab- It is therefore natural to choose the time coordinate by imposing 
a condition on the extrinsic curvature. One particular choice widely used in numerical relativity 
demands that trace of the extrinsic curvature tensor vanish: 

K = K^a = and dtK = 0. (2.92) 

This choice is called maximal slicing and it is a particularly useful slicing for numerical compu- 
tation since it tends to avoid spacetime singularities — such as those that arise from gravitational 
collapse of matter to a black hole — by "freezing" the evolution in regions close to locations where 
such singularities are developing [131]. Considering a congruence of worldlines for a family of ob- 
servers travelling normally to the hypersurfaces, the maximal slicing condition implies that the 
expansion of the congruence vanishes, inhibiting focusing of the worldlines as well as the formation 
of caustics. Moreover, as the name suggests, and due to the non-Euclidean signature of spacetime, 
K = Q slices have maximal volume with respect to small, but arbitrary, deformations of the hyper- 
surfaces. The interested reader is directed to reference |132| for a more detailed discussion of this 
choice of time slicing and its properties. 

2.4.2 Conformal Decomposition of the Einstein Equations 

Conformal decompositions of the Einstein equations in the context of the 3 + 1 formalism were first 
introduced by Lichnerowicz in 1944 [33], who proposed a specific conformal decomposition of the 
Hamiltonian constraint. The goal of the decomposition was to write the constraint as a manifestly 
elliptic partial differential equation for a specific part of the 3-metric — namely an overall scale, 
or conformal factor — and then to establish existence and uniqueness of solutions of the PDE. In 
the 1970's and early 1980's, York and his collaborators (most notably O Murchadha) |1331 11341 
11351 11361 11371 11381 11391 1140j . made significant and highly influential advances of Lichnerowicz's 
work through a general program aimed at understanding which of the basic dynamical variables in 
the 3+1 approach should be freely specified at the initial time, and which should be fixed by the 
constraints. An analogy was made to electromagnetism, where the field can be decomposed into 
longitudinal and transverse parts, with the former representing the gauge degree of freedom, and 
the latter encoding the dynamical (radiative) content of the theory. Building on previous work 
by Deser and others [1411 1142] , York [136] thus considered similar decompositions for the case of 
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the symmetric rank-2 tensor fields that appear in the 3 + 1 approach: namely, the spatial metric 
and the extrinsic curvature. The main idea was that a covariant[^ decomposition of these tensors 
could yield at least a formal (or perhaps approximate) solution to the constraint equations, as well 
as providing a route to establishing existence and uniqueness of the solutions. In addition, it was 
hoped that the process would lead to an identification of the "true" dynamical degrees of freedom 
of the gravitational field within the 3 + 1 framework. Ideally, these degrees of freedom were to be 
covariant and freely specifiable, and were to encapsulate the radiative content of general relativity. 
York's work resulted in a specific conformal decomposition in which the transverse-traceless part 
of the dynamical variables, {jab, Kab}, were to represent the radiative degrees of freedom. 

Now, there is a strong argument to be made that the goals of York's effort were not entirely 
successful, especially in terms of identifying radiative degrees of freedom on a single slice in the 
strong-field regime. However there can be no doubt that his program was an absolutely crucial 
development for numerical relativists since it allowed the constraint equations to be written as a 
set of coupled, nonlinear elliptic PDEs, for which existence and uniqueness could be established. 
Moreover the equations could be tackled and solved using standard numerical approaches for elliptic 
systems. Most importantly for this thesis, the conformal transverse-traceless decomposition lies at 
the heart of the approximation to the Einstein equations that we adopt for our model. 

Thus, reemphasizing that the physical interpretation of York's approach is still a matter of 
debate, we now proceed to work through the details of the decomposition. We start with 
the general case in which no assumptions are made about the 3- metric, jat, and then specialize 
in Sec. l2.4.3l to the case of a conformally flat jab- Our discussion parallels lecture notes on the 3 + 1 
approach due to Gourgoulhon |144| and, as mentioned previously, we include the development here 
largely for the sake of completeness. 

The first step towards the conformal decomposition of the Einstein equations in 3 + 1 form is 
an investigation of how each of the fields that appear in the equations changes under a conformal 
transformation. Specifically, we must consider the action of conformal scalings on the components 
of 1) the 3-metric, jij, 2) the connection, C^jk, associated with the spatial covariant derivative Di, 
3) the 3-Ricci tensor, TZij, and, finally, 4) the extrinsic curvature Kij. Each of these is discussed in 
turn below, after which results are assembled and used in the decomposition of the field equations 
themselves. 

For the case of the induced spatial metric, jij , conformal decomposition means that we introduce 
a base metric, jij, and a strictly positive function, ip = '^(x'^), known as the conformal factor^ and 
then write 

7„- = . (2.93) 

We note that 7^ is frequently known as the conformal metric and emphasize that it has no direct 
physical interpretation. Defining the inverse of the conformal metric, 7^^^, in the usual way, so that 
lijl''^ = 5^i, and noting that jijj^^ = we have 

yj^^-4^»i. (2.94) 

Proceeding to the connection, we first recall that any covariant derivative, Di, can be defined 
in terms of the ordinary derivative operator, di via 

Av^' =a,^;■'■+P,fc^;^ (2.95) 

where are the components of a spatial vector and ik are the Christoffel symbols associated 
with Di. If the covariant derivative is chosen to be compatible with the spatial metric, as will be 
done here, then we have Dijjk = and the Christoffel symbols can be calculated using the usual 

^^Here "covariant" means covariant with respect to coordinate transformations in the hypersurfaces, i.e. with 
respect to spatial diflfeomorphisms. 

^*The reader who is interested in more details concerning transverse-traceless decompositions and their relationship 
to gravitational radiation in general is directed to the review paper by Thorne |143| . 
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formula: 

T'^k = {d.au + ^kl^l - 9^7^fc) ■ (2.96) 

Similarly, choosing a covariant derivative Di compatible with the conformal metric, so that -Di7jfc = 
0, we have: 

f^fc = [dau + dkia - da,k) ■ (2.97) 
The two covariant derivatives Di and Di can be related through the connection tensor, ik'. 

D,v^ = D,vi + Chkv\ (2.98) 
where ik is calculated in terms of conformal covariant derivatives as follows: 

Chk = ^7'' {Daki + Dkia - Da^k) ■ (2.99) 

This last expression can be rewritten in terms of the conformal metric using equations (j2.93p and (|2.94p 
along with the fact that Di^fjk = 0. After some manipulation, we find: 

C^,k = 27^' [a (hi ^) %i + bk (In ^) - Di (In ^) 7,fe] . (2.100) 

The contraction of the above equation on its first and second indices, or, due to the symmetry 
ik = & ki, on its first and third indices, can be used to provide a useful expression for the 
divergence of a vector in terms of a conformal divergence of an appropriate conformal scaling of 
the vector: 

= b,v' + = + 6A (hi V) = (^^') . (2.101) 

We next reexpress the spatial Ricci tensor, TZik, and Ricci scalar, TZ, in terms of their conformal 
counterparts, TZik, and TZ, as well as additional terms involving conformal derivatives of ip. To that 
end we start from the definition of the spatial Riemann tensor and then rewrite it in terms of the 
connection tensor C^ik- From the definition (|2.23p we have 

TZ^jk^wi::=2D[,Dj]Wk. (2.102) 

In addition, the covariant derivative of a tensor of type (p, q) is given by 

Ar-'^,,...,, = Ar-'",,...,, + E c^'^.r-^-^'',,...,, - E c%,X'-'^n...s...,, . (2.103) 

(=1 (=1 

Using p.l03p in p.l02p . and performing some algebraic simplifications we find: 

TZtjk'' ~ TZijk — '^D[iC''j]k + 'i.C'"' k[iC^ j]m , (2.104) 

The Ricci tensor is now given by contracting this last equation on its second and fourth indices, 
yielding 

TZik = TZtik'' = TZtk — '2D[iC''i]k + 2C""fc[.jC';],„ . (2.105) 

Using (|2.100p in (|2.105[ we have the desired relationship between the physical and conformal spatial 
Ricci tensors. 

TZ^k = 7^,;fe-2AA(hlV)-2W"A^n^(lnV') 

+ 4A (InV-) A (In V) - 47,fc7'™A (In V) A„ (InV') • (2.106) 
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Taking the trace of this equation then provides an expression relating the physical and conformal 
Ricci scalars: 



7^ = 7*'=7^a. - 



n - 8f''D,Dk (In V) - Sf'^D, (In V) Dk (In^A) 

= i;-^n~8^p-''f''D,Dk^, (2.107) 

where TZ = j'^'^TZik- One sees immediately from this last equation one of the crucial results of 
Lichnerowicz's approach: namely that the conformal transformation "pulls out" from the extremely 
complicated expression for TZ (when written out in full for the case of a generic 3-metric) a term 
that is proportional to the covariant Laplacian of the conformal factor, j'^'^DiDkip 

Before moving on to a conformal treatment of the Einstein equations themselves, we need to 
perform a conformal decomposition of the extrinsic curvature tensor, Kij. In anticipation of our 
use of the decomposition in conjunction with maximal slicing, K = K^i = 0, it is convenient to 
first write the tensor as a sum of traceless and traceful pieces: 



K,, = A,j + -Kj,j, (2.108) 



K'^ = A'^ + \kj'', (2.109) 

where Aij and A^^ are, by definition, the traceless parts of Kij and ivT*-', respectively, so that 
= jijA''^ = 0. In the following it will eventually be — i.e. the traceless part of K^^ — 
that we will conformally transform according to 

A'^ = iP'A'^ . (2.110) 

The non-zero conformal power, s, appearing in this definition can be chosen so that convenient 
mathematical relationships result. In this regard it turns out to be natural to start from the 
evolution equations (|2.46l) for the spatial metric components. We recall that these are given by 

Cm7ij = ^(t-i3)li] = ~2aKij , (2-111) 
where m'' = r - = an°. Using ()2.108p and (|2J3| in the above, we find 

2 

Cmlij = ~2a'ip~'^Aij - -aKjij - Aj^jC^n (In?/') ■ (2.112) 

Note that this equation involves time derivatives (through the Lie derivatives along m°) for both 
the conformal metric and the conformal factor itself, and in this sense is a "coupled" evolution 
equation for jij and ip. Since we wish to treat these quantities as dynamically independent, we 
must perform some manipulation to get decoupled evolution equations. We start by taking the 
trace of (|2.112|) by contracting both sides with 'jij, and making use of j"^^ Aij — Aij = 0: 

f'Cm7^J = -2aK - 12A„ {In^j) . (2.113) 

We now use the well known formula 

J(lndct A/) EE tr (AJ-^ ■ SM) (2.114) 

where M is an invertible matrix, d represents an arbitrary derivative operator, tr denotes the trace 
operation and • is matrix multiplication, to rewrite the left hand side of (|2.113p as 

f^CmJi] = Crn (lndet7y) = Cm (ln7) , (2.115) 
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where 7 = det(7ij). Now, in the approach to the conformal decomposition of the Einstein equations 
that we are following, a key demand is that the determinant of the conformal spatial metric be Lie 
dragged from hypersurface to hypcrsurface along the vector field i": 

£t7 = 0. (2.116) 

Using (|2.116p in (|2.115p we have 

f'C^%^-Cp{\n-i). (2.117) 
Moreover, again using (j2.114p we can write 

-/:^(ln7) = -f^£^%-, (2.118) 

so that (|2.117|1 becomes 

f'C„,% = -f'Cp% = -2D,f3\ (2.119) 

where we have used the definition (|2.37p in the last equality above. We can then use this last result 
to eliminate the term "f^-' Cmjij that appears in Eq. (|2.113|) . Performing the substitution and some 
trivial manipulation and rearrangement, we have (recalling that m° ~ t"" — /3") 

£,„ In V- = {dt - Cp)\n^: = i (a/3' - aK^ ■ (2.120) 

This then provides the desired (decoupled) evolution equation for the conformal factor, ?/;. 

Additionally, (|2.120p can now be used in (|2.112p to yield a set of evolution equations for the 
conformal metric components: 

2 

{dt - Cp) 7ij = -~2ai~'^Aij - -aK^^j - Aj^jCm (In?/') 

= -2aly - ^7„- A/3^ (2.121) 

where we have (suggestively) defined Aij = tl)~'^Aij. Note that Aij is traceless 

fiA.j = ^S''^-^A,, = , (2.122) 

and that the contravariant components, A*-' , are given by 

A'i = f'^7J'ife,A*^' = ir^A'\ (2.123) 

Thus, the conformal decomposition of the evolution equation for the spatial metric suggests that 
we choose s = — 4 for the conformal exponent in 

A'^ = iP'A'i . (2.124) 

It is worth emphasizing, however, that the choice s = —4 is not unique in terms of leading to 
simplifications (or "naturalness") in the conformal equations. Another common scaling that was 
first adopted by Lichnerowicz |33] is s = —10. This choice also arises naturally from a decomposition 
of one of the Einstein equations, but this time when the decomposition is applied to the momentum 
constraint equations. Since we do not use this specific decomposition in the current work, the reader 
who is interested in further details concerning it is directed to Gourgoulhon's lectures |144| . 

We now turn to the task of deriving conformal evolution equations for Aij and K. This will be 
done in two stages. We will first use the 3 + 1 evolution equations for Kij, as given by (|2.48p . to 
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derive evolution equations for the trace-free components, Aij, and the trace, K. These equations 
will be expressed in terms of physical (i.e. non conformally scaled) quantities, but will then be 
used in the second stage to derive update equations for Aij and K that are written in terms of the 
conformally scaled variables. 

We thus start from (|2.48p , which we recall reads 



-DiDja + a 



Now from definition ()2.108p of the decomposition of Kij, we have Kij ~ Aij + ^K^ij, so 

1.1. 1 2 

where we have used (j2.11ip in the last step. Furthermore, we have 

where we have used (|2.47p 

In addition, taking the trace of p.l25p we find 

j'^CmKij = -f'DiDjU + a[n + K^ - 2KijK'^ + 47r (S* - 3p)] . 
Substitution of this last result in p.l27p gives 

CraK = -f^ DiDja + a [7^ + + 47r (5 - 3p)] . 



(2.125) 

(2.126) 

(2.127) 
(2.128) 

(2.129) 
(2.130) 



Now the 3-Ricci scalar TZ is an extremely complicated function of ^ij and its first and second 
derivatives. It is thus computationally advantageous to use the Hamiltonian constraint p.49p in 
the form 

7^ = -if^ + KijK'^ + 167rp (2.131) 
to eliminate TZ from (|2.130p . The yields the evolution equation for K in its final form: 

CraK = {dt -Cp)K = ~fW,Dja + a [K,,K'i + in (S + p)] . (2.132) 

We now continue by deriving the evolution equation for the the trace free components, Aij, of 
the physical extrinsic curvature tensor. We first solve (|2.126p for CraAij- 



1 . 2 

Cra Aij — CraKij "^^ij Cja-^ ~^^KKij . 

Using (|2.125p and (|2.132p to replace the terms CmKij and CmK, respectively, we find 



(2.133) 



r 4 

^^ra-^ij 



-DiDja 



7^^ 



-KK, 



2K,kK^, 



87r S, 



-7,, [-i'*'DiDua-a{n + K^)] . 



(2.134) 
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Using the decompositions 



A. 



(2.135) 
(2.136) 



in the above, we find, after some manipulation 



r A- 



1 



^TT I Sij - gTij'S' 



{j"'DiDka - an) 



(2.137) 



This is the final form of the evolution equation for Aij, which together with the evolution equa- 
tion (|2.132p provides an equivalent system to the original 3 + 1 equation (|2.125p for Kij . 

We now proceed to the second stage of the current calculation which involves reexpressing p.l37p 
and (|2.132p in terms of the conformally rescaled variables. We begin by considering the left hand 
side of (|2.137p . which can be rewritten as 



r A 



I] ■ 



Using (|2.120l) to eliminate the term in this last expression we have 



^mA-ij '0 



r A - - 



~Aj 



aK 



(2.138) 



(2.139) 



Replacing the left hand side of (|2.137p with the right hand side of the above equation, and solving 



for CmAij, we find 



r 4 



7^^ 



-7y {-/"'DiDka ^an)}~ -A,, ibiP' - aK 



-KA,, - 2A,kA''^ 



2 7 



(2.140) 



Proceeding, we must also rewrite the terms involving the physical covariant derivatives of the 
lapse function, a, using the derivative operator, D^, which is compatible with the conformal metric 
7ij. Specifically, we have 

D,D,a = b,{Dja)-C%{Dka) 
= DiDja - C%Dka 



= D.Dja - 2Dka 



b, (In ^) + bj (In ij) 5\ - %-7"'^i (In i') 

= b.bja - 2bi (In V') bja - 2b j (hn/-) Aa + 2^,,j^^^bi (In i/-) bka , (2.141) 

where the definitions (|2.103|) and (|2.100p were used in the first and third lines, respectively. We 
also need the trace of this equation, which is given by 



fW^Dja = ijj-^fW.Dja = V"'' f^b.b^a + 27'^' A (InV') b^a 



(2.142) 



We have now expressed all of the terms that appear in the evolution equation (|2.140p for At 
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in terms of conformally scaled quantities. Assembling results, we find after a considerable amount 
of algebraic simplification that 



r 



-D,D,a + 4^(, (In A) a + i^^'^' (aA^ - 4Dk {\n^) Dia 
T^rj - - 2^,: A (liWO + (In^A) (InV') 



%-7*'-' [DkDi (In V) - 2Dk (In^) A (In V) 



+ a 



KA,j - 2^^'A,kAi, - 8^ ( - -7,, 5 



1 



2 - 
3 



(2.143) 



Here we have used equations (|2.106p . (|2.107p . (|2.14ip and (|2.142p as weh as = i^^Aij. 

We can now also write down the evolution equation for K in terms of the conformally scaled 
variables. To do so we first note that from the decompositions ()2.108p and 12.109]) . and using the 



fact that Aij is traceless, we have 



A 



A'^ + -K^'' 
o 



A,,.A'^ 



Using this last result as well as ()2.142p in ()2.132p . we find 

{dt -Cp)K^ ^^-^f^ \bkbia + 2bk (Ini^) bia 



1 



+ 47r (5 + p) 



(2.144) 



(2.145) 



Eqs. p.l43p and (|2.145p are our desired evolution equations for the extrinsic curvature quantities 
within the conformal framework. 

Having dealt with the evolution equations for the spatial metric and extrinsic curvature, we are 
left with the conformal treatment of the constraint equations. 

First, using (|2.107p and (|2.144l) in the Hamiltonian constraint (|2.49p . we readily find 



f'bkbii^^\i,n 



1 T 



1 



^AkiA'^' - —K' + 27rpj ^.^^ = . 
Second, in order to rewrite the momentum constraint, we first calculate 

3 

= bjA'i + C)kA^'' + C'jk^'^ + l^j-'^fWjK 

o 

= bjA'^ + Abj (In V') A'^ + 6b j (In V') A'^ + -i^-^^b'K 



(2.146) 



bjA'^ + ^-^°bj {iP^^A'^) + -ip-'^b'K. 

3 



(2.147) 



In the above derivation we have used the following: (|2.108p in writing down the first line; p.lOSp . 
as well as the fact that the actions of Di and bi on scalar functions such as K are identical, 
in going from the first to the second line; expression (|2.100p for the connection, and the fact 
that Aij is traceless in going from the second to the third line. Using (|2.147p in the momentum 
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constraint (|2.50p then yields 



and using A^^ = -0 ^A*^ we have 



2^. 



DjA'^ + QDj (In?/') - -I^'ii' = %TTi^'^J\ 



(2.148) 



(2.149) 



We have now completed the task of confornially decomposing the Einstein field equations within 
the 3 + 1 formalism, and it is thus appropriate to collect the results. 

We have the following set of evolution equations for the conformal factor, ij}, the conformal 
metric 7^ , the trace of the extrinsic curvature, if, and the conformally scaled, trace-free part of 
the extrinsic curvature, Aij: 



aK] , 



-2a A 



{dt - Cp) K = -^-^f^ DkDia + 2Dk (Ini^) Dia 



AuA'^' + -K^ + {S + p) 



(2.150) 
(2.151) 
(2.152) 



~b,bja + Ab(, (In?/;) D^^a + ^l^jl^^ {bkDia - 4^fc (In^) Dia^ 



^^^3 - fl^jT^ - 2Ai^j (In?/') + 4 A (In?/') (In?/') 



2 

+ 3 
+ a 



%-7^-' {bkbi (ln?/0 - 2bk (In?^) A (In?/') 



^ Ay- A/3'. 



(2.153) 



We also have the Hamiltonian and momentum constraints: 



7'^'' - ^^7^ + ( ^AkiA"' - -^K' + 2^p ) = 0, 



1 



1 7 



1 



(2.154) 



bjA^i + 6 A (In?/') i'^' --b'K^ 8TTi''^J\ 



(2.155) 



The above system of 3 4- 1 conformal Einstein equations is to be solved for the conformal 
unknowns ?/', "fij, K and Aij. Once the conformal variables have been computed, the physi- 
cal quantities — namely the spatial metric 7^ and the extrinsic curvature Kij — can be recovered 
from (mil), (Pl^ and A^ = i^^Aif 

lij 



= ?A^ ( A, J + 



(2.156) 
(2.157) 
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2.4.3 The CFA Equations 

We are now in a position to derive the equations that define the approximation of general relativity 
that we adopt in this thesis. As discussed previously (Sec. l2.4."2|) . a key element of the approximation 
is the demand that the 3-metric, 7ij, be conformally flat, so that 7^- is the constant, flat 3-metric, 
which we denote ^ij. This motivates the nomenclature "Conformally Flat Approximation" (CFA) 
that we have adopted. However, we emphasize that wc combine the demand of conformal flatness 
with the particular time coordinatization given by the maximal slicing condition, which requires 
that _ftr = on each hypersurface. Together, conformal flatness and maximal slicing result in a 
significant simplification of the system of conformal Einstein equations that was displayed at the 
end of the previous section. First, denoting by Di the covariant derivative compatible with the flat 
3-metric 7^^, and using K = 0, the evolution equation (|2.150p for the conformal factor -0 reduces 
to 

^ " (2.158) 



Next, the evolution equation (|2.15ip for 7^, becomes 



{dt - C[3)^ij 



(2.159) 



Continuing, since K must identically vanish on all slices, we have {dt — ^p)K ~ 0. Thus 
equation (|2.152[) becomes 







DkDia + 2Dk (In^b) Dia 



(2.160) 



which is now interpreted as an elliptic equation that the lapse function, a, must satisfy on each 
slice in order that the hypersurfaces remain maximal. 

Proceeding to equation (|2.153p . we note that since the conformal metric is flat, the associated 
Ricci tensor, TZij = TZij and Ricci scalar, TZ = it, both vanish. Using this fact, along with K = 0, 
the evolution equation for Aij becomes 



b,bja + Ab^, (ln0) bj)a + -7^7'^' {bubia - Abk (Ini^) bia 



-2D,bj (In V-) + AD, (In?/-) Dj (In?/-) 
P^jt" (Dkbi (InV^) - 2bk (In 7/.) A (In^) 



2f'A,uAi, - 8^ ( ^-^S,j - ^7,,5 



\A,jbi(i' . 



(2.161) 



Turning to the Hamiltonian constraint (j2.154p . and again using TZ — and A' = 0, we flnd 



1 1 



TDkDi^ + ( -AkiA^' + 2np ]ifi = Q 



(2.162) 



Finally the momentum constraint (|2.155p becomes 



DjA'^ + 6Dj (InV') i'J = Siri'^r , 



(2.163) 



where once again K = has been used. 

Eq. (j2.159p merits further attention. Since the conformal 3-metric is constant from slice to slice 
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wc have 9t7ij = 0. Additionally, Eq. (|2.37p tells us that we can write 



(2.164) 



where we have used Dk^ij = in the last step. Using this last equation along with dt^ij — 
in p.l59p we find 



1 

2^ 



We can also raise both indices of (|2.165p — using the inverse flat metric, -y*-' — to get 



(2.165) 



(2.166) 



We thus observe that having made the requirement of conformal flatness, we can view the compo- 
nents of A*^ as being derived from the shift vector components, (3^ . Now our general inventory of 
geometric variables within the context of the conformal 3 + 1 approach included the 4 kinematical 
quantities a and (3^, and the 12 dynamical fields ■0, 7ij, K and Aij. We now see that of these, 
once conformal flatness has been imposed and we have chosen i^T = 0, only a, (3^ and ij} remain as 
independent variables. We have already noted that (|2.160p provides an elliptic equation that fixes 
a on each slice, and we will display the final form of that equation momentarily. We now proceed 
to show that the constraints provide similar equations for -0 and (3^ . 

To do so, we first consider the divergence of A^^ . Using (|2.166p we calculate 



1 



bjA'^ - —b, ( f'bkp' + ^'''bkP' - ^f'bkP'' ] - -b,aA'^ 



1 



— ( ^^''DkD,l3' + -f^DkD.p^ - 2A'W,c 



(2.167) 



Using (|2.167p and (|2.166p in the momentum constraint (|2.163p and rearranging, we find 
1 



16^0" J' 



1 



f''DkD,l3^ + 



(2.168) 



which are 3 elliptic equations for the 3 shift vector components, /3'^. Next, we use (|2.165p and 
(|2.166p to rewrite the product AijA^^ in terms of covariant derivatives of the shift vector: 



AjA^' = — \^kjfD,p''DiP^ + DkP'D^P' - -D,P'D,(3^ 

Substituting the right hand side of this equation into the Hamiltonian constraint p.l62p and 
rearranging, we have 



^ ( 7fc,r'A/3"A/3^ + Dkl3W,^'' - =;D,PW,(3^ ) - 2ttp 



05 



(2.169) 



which is an elliptic equation for the conformal factor, ip. 
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Finally, (|2.160p . which we recall arises from the demand that K = can be written as 



TDkDia 



2f^bk (ln?A) bia + V/a ifcii^'' + 47r (5 + p) 



+ AniP^a (S* + p) , 



(2.170) 



and is, as stated previously, an elliptic equation for the lapse function, a. 

We emphasize that a sequence of solutions of equations (|2.168p - (|2.170p — computed on each 
hypersurface of the spacetime — completely determines the 4-dimensional metric, g^^. Specifically, 
we have 



Table ^lA\ summarizes the steps taken in this chapter to produce the CFA equations of motion. It 
includes enumerations of the full Einstein-Klein-Gordon equations within both the standard and 
conformal 3 + 1 approaches, as well as the subset of the conformal equations that were used to 
derive the CFA system. 

In concluding this section, we note that the system ()2.168p - ()2.170p is still written in a generally 
3-covariant form — that is, the equations are valid for any set of spatial coordinates that we might 
choose to adopt in conjunction with a flat 3-metric. In order to recast the system as a specific set 
of PDEs that can be solved numerically, we must first fix the spatial coordinates, and this is the 
topic of the next section. 



ds 



2 



g^pdx^dx'^ 

-a^df + {dx^ + 13' dt) {dx^ + 13'^ dt) 




(2.171) 



EKG cqns. in 3 + 1 form. 



Conformally decomposed EKG eqns. 



CEA equations of motion. 



n + K'^ - KijK'' = 16np (p:49)l 



DjK'^ - D'K = 8ivf ((230ll 



DjA'' + 6Dj (Ini/') A'' - ^D'K = Stt^^J' (pT^ 



(a* - £,3) ^ = -V (A/3' - oA-j (121^ 
{dt - Cp) = - 2aA,j - 1% 7^^^*= (12^511 
{dt ~Cp)K = - ^-^''^ [bkDia + 2bk (In V) Aq] 



'C{i-«V' = gV'A/?' (I2A581) 
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Table 2.1: Equations of motion. This table summarizes the steps taken in the derivation of the CEA equations of motion. The fully general 
relativistie 3 + 1 Einstein-Klein-Gordon (EKG) equations, and their conformal decompositions, are given in the first two columns. The first 
row lists the constraint equations, while the evolution equations for the geometric and matter variables are are listed in the second and third 
rows, respectively. The maximal slicing condition and assumption of conformal flatness yield the CEA equations; these are given in the third 
column. Note that within the CEA the components of the traeeless conformal extrinsic curvature components, Aij, are defined in terms of the 
lapse function and shift vector, and that the evolution equations for the conformal factor, t/;, and the Aij^ shown in red, are not used. 
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2.4.4 Specialization to Cartesian Coordinates 

The choice of spatial coordinates is frequently a crucial issue in numerical relativity calculations. 
In cases where the spacetinie is constrained to have an exact symmetry, or where the physical 
scenario has an approximate symmetry, coordinates adapted to the symmetry arc often used. 
Adopting such coordinates — which will generally be curvilinear — can be advantageous not only for 
the significant simplification in the equations of motion that may result, but also for providing 
a "natural" representation of the solution, which in turn can minimize computational cost. For 
example, consider the problem of simulating a single star in 3 spatial dimensions plus time, but 
where the star remains approximately spherically symmetric. Then, irrespective of the method we 
choose to discretize the continuum equations (i.e. finite difference, finite element, spectral etc.), we 
can expect that for specified accuracy it will cheaper to compute a solution using spherical polar 
coordinates than Cartesian coordinates. 

However, especially when dealing with tensor equations, the use of curvilinear coordinates 
for numerical calculations can also be problematic. Such coordinate systems generically have 
coordinate singularities, such as r = and the "z-axis" in the spherical polar case, where special 
attention and care is needed when designing and implementing the numerical approach to ensure 
that the computed solution remains smooth. In any case, the ultimate goal of the work described 
here is to simulate boson star binaries, where the expected solutions will not have any specific simple 
symmetry that would motivate us to adopt some special curvilinear coordinate system. We instead 
use a Cartesian coordinate system, {x,y,z), which has the advantages of 1) covering the spatial 
hypersurfaces in a smooth fashion (i.e. without any coordinate singularities or other pathologies) 
provided that there are no physical singularities on the slices, and 2) inducing particularly simple 
forms for the differential operators appearing in the governing PDEs. 

In addition to having Cartesian "topology" , and in contrast to some previous related work (most 
notably that of Wilson, Mathews, Marronetti HHHZlIIHllIllinillllllUlilllSDwe require that 
our coordinate system be "asymptotically incrtial", i.e. we demand that at large distances from 
the matter sources the metric components approach those of flat spacetime in an inertial frame, 
gfiv — > Vfiiy diag(— 1, 1, 1, 1). In the work of Wilson and collaborators the coordinate system was 
typically chosen to be in corotation with the binary system being studied. Finally, we emphasize 
that the topology of the hypersurfaces is taken to be K^, so that the slices are infinite in extent 
in all three directions. Naturally, this has significant implications for the numerical treatment of 
boundary conditions in our model as will be discussed in following sections. 

Thus, from this point on, the set of field variables defining our model system, namely a, ipi P^-i 
i = 1,2, 3, and (jiA, H^, A= 1,2, are all to be understood to be functions of (t, x, y, z). 

With the choice of Cartesian spatial coordinates, the conformally flat 3-dimensional line element 
is simply 

^^Us^ ^t/j'^idx^ +dy^ + dz^). (2.172) 

We can now display the equations of motion for our model in essentially the form used for our 
numerical computations. We start with the evolution equations for the complex scalar field, ()2.67p 
and (|2.68[) . which become 

dt(l)A = ^n^ + /3'9,0A, (2.173) 

dtUA = {P'-'Ua + aij^'^d^H) + dy {pmA + aiP''dy(j)A) 

+ 5,(/3^n^+aV'5,</)A) -aV'^^S^'/'A. (2.174) 

"00 
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Continuing, the energy-momentum quantities defined by (|2.7ip - (|2.76p are 

2 



1 



E 



^12 + ^4 
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2 

E 



(2.175) 
(2.176) 
(2.177) 
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^ = ^E 
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(<9a;(?!)A)^ + {dy4>AY + (S^l?!)^)^ 



- iv'%[/(02), 



(2.178) 



■ 1 



(ajj^^)- + {dy(i)AY + {d^(t>AY 



P + S = 



E 



2-^ 

^12 



-(7(</>g). 



- -(7(<^2)^ (2.179) 



(2.180) 



Next, we have the equations that constrain the geometric variables. The maximal slicing con- 
dition (|2.170p for the lapse function becomes 



d'^a d^a d'^a 
dx^ dy^ dz^ 



2 



dijj da dtfj da dtp da 
dx dx dy dy dz dz 



+ aV/ (a^,A'^ + 4^ (p -f- S)) , (2.181) 



while the Hamiltonian constraint (|2.169p for -0 is 

d'^ip d'^Tp d'^ip 



a-ip a-tp a-tp w ( ~. ji,- , 



dx^ dy"^ dz'^ 
For both (|2.18ip and (|2.182p the term A^jA'^ is given by 



(2.182) 
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Finally, from the momentum constraints (|2.168p , we have the 3 equations that fix the components 
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of the shift vector: 
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dP' 
dx 



The 4 hyperbohc scalar field evolution equations, (|2.173p and (|2.174p . along with the 5 elliptic 
equations, p.l8ip - (|2.185p . constitute the basic set of equations for our model. This set of PDEs 
must of course be supplemented by boundary and initial conditions in order to complete the 
mathematical prescription of the model, and these will be discussed in the sections that follow. 

Before moving on to that discussion though, it is worth mentioning that the derivation of 
equations of motion such as the above set is a non-trivial and error prone process. It is therefore 
very useful to use symbolic manipulation software to check calculations, and we have done so. 
Specifically, after having been derived by hand, the equations presented in this thesis were checked 
using Maple |145j . including a tensor manipulation package due to Choptuik [146] . 



2.4.5 Boundary Conditions 

In this section we discuss issues related to the boundary conditions that are to be applied in 
conjunction with the equations of motion summarized above. Mathematically, our model is to be 
solved as a Cauchy problem where the t = const, surfaces extend to spatial infinity. We will 
restrict attention to cases where the matter source (the complex scalar field) has compact support 
on the initial time-slice: the hyperbolicity of the scalar wave equation then guarantees that the 
matter can never reach spatial infinity, which, of course, is appropriate from a physical point of view. 
This restriction, combined with the demands that 1) the spacetimes we construct be asymptotically 
flat, 2) that our (x, y, z) coordinate system be "asymptotically inertial" , and 3) that coordinate 



^^We note that our use of the term "boundary conditions" hero is thus a sUght abuse of nomenclature in the sense 
that the t = const surfaces are edgeless, and thus have no boundaries. 
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time is identical to proper time at infinity, provides the following set of boundary conditions: 



lim 


4>A{t,x,y,z) = 0, 


(2.186) 


lim 

r— >-oo 


IiA{t,x,y,z) = 0, 


(2.187) 


lim 

r— ^oo 


ip{t,x,y,z) = 1 , 


(2.188) 


lim 


a{t,x,y,z) = 1 , 


(2.189) 


lim 

r— >oo 


/3'=(t,x,y,z) = 0. 


(2.190) 



Here and below r is defined by r = x'^ + y^ + z"^. 

The computational problems that arise from the fact that our boundary conditions are naturally 
specified at infinity are familiar ones, not only in numerical relativity, but in many other areas 
of numerical analysis and computational science that involve the solution of hyperbolic PDEs on 
unbounded domains. A key, if rather obvious observation, is that while the spatial domain is infinite, 
any specific numerical computation based on a discretization technique (such as finite differencing, 
as used in this thesis) must be restricted to a finite number of discrete unknowns. Given this, there 
are essentially two basic strategies for dealing with the infinite solution domain. The first involves 
artificially introducing boundaries at x — Xmin, x = Xmax, etc. and then imposing approximate 
boundary conditions on the solution unknowns. The second involves "compactification" of the 
infinite domain, using a coordinate transformation which maps infinity to a finite value of the 
transformed coordinate. The exact boundary conditions can then be applied on the boundaries 
of the compactified domain. In the current work, we have experimented with two variations of 
the first approach, and one of the second. We proceed to discuss each of the approaches that we 
studied, and in the the original order that they were investigated. We highlight specific challenges 
that we encountered, and indicate possible future directions for improvement. We also note that 
parts of the discussion rely on computational concepts and techniques that are discussed in detail 
in Chap. IH The reader may thus wish to postpone a detailed study of this part of the thesis until 
that chapter has been perused. 



Spatial Compactification 

Spatial compactification was the first strategy we considered for a computational treatment of the 
conditions ()2.186p - p.l90p . As already mentioned, the basic idea in this case is quite simple: a 
smooth coordinate transformation is applied to map each infinite coordinate range to a finite in- 
terval, which, without loss of generality we can take to be [—1, 1]. Since elements of the Jacobian 
matrix of the coordinate transformation will appear in the PDEs when written in the new coordi- 
nates, it is reasonable to require that the transformation be given in some simple closed form. With 
this in mind we chose the hyperbolic tangent function to define compactified coordinates (^, rj^ Q 

by 

£,{x) = tanhcc, ri{y) = tanhy, C,{z) = tanhz. (2.191) 

Clearly, this transformation maps — oo < x,y,z < oo to —1 < ^,77,^ < 1, as desired, and the 
exact (Dirichlet) boundary conditions can then be set at 7], ( ~ ±1. Transforming the PDEs that 
govern our model requires nothing more than the chain rule for differentiation. For our specific 
choice of compactification we have 
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and 



'2 



(2.193) 

as well as analogous formulae for the y and z derivatives. Using these formulae, we derive the 
transformed equations of motion simply by rewriting all of the derivatives appearing in (j2.173p - 
p.l74p and p.l8ip - (|2.185p . in terms of derivatives taken with respect to the compactified variables. 

We note at this point that spatial compactification of the sort we have just described generically 
leads to well-known computational problems for hyperbolic systems. In particular, for the case of 
finite differencing, and assuming that the discretization is uniform in the compactified coordinates 
(i.e. so that the spacing between grid points in each of the coordinate directions is constant), 
the physical separation between grid points becomes increasingly large as the boundaries of the 
compactified domain are approached. This means that waves propagating outwards invariably 
become very poorly resolved. In turn, this can lead to various difficulties including numerical 
instabilities and spurious reflections of waves back into the interior of the domain. 

Nonetheless, spatial compactification of this type has proven successful in some previous time 
dependent calculations in numerical relativity. This includes a study of the Gregory-Laflamme 
instability of black strings |147| . as well as Pretorius' ground-breaking work on binary black hole 
inspiral and merger [43l I148j . In both of these instances, computational difficulties of the sort 
mentioned above were kept under control through the addition of numerical dissipation. 

Given the previous discussion, as well as the fact that compactification is routinely used with 
great success in the numerical analysis of elliptic PDEs, it is somewhat ironic that it was problems 
associated with the solution of the elliptic equations of our model that ultimately forced us to 
abandon compactification. Noting that the specific difficulties we encountered are well documented 
in the numerical analysis literature, we nonetheless feel it important to discuss them here in some 
detail. We are especially motivated by the fact that there appear to be very few instances where 
elliptic systems have been solved in numerical relativity using compactification in conjunction with 
finite difference techniques. 

In order to understand the nature and source of the problem we ran into, it suffices to consider 
a model elliptic PDE, namely the Poisson equation: 

^9 / N / N d'^U d'^U 

Vuix,y,z) = p{x,y,z) ^ _ + _ + _=p. (2.194) 

Here u is the unknown function while p is a specified source function, which we will require to have 
compact support. In analogy with the elliptic equations of our model system we will consider the 
solution of (|2.194p on M.^^ and require that 

lim u[x,y,z) = 0, (2.195) 



where, again, r = \/x'^ + y"^ + z"^. We now transform p.l94p to the compactified coordinate system 
defined by (|2.19ip using p.l93|) and the corresponding formulae for the second y and z derivatives. 
We get 



m - - Ml 2C(1 Pit V, 0, (2.196) 

where u = u((,, rj, C) = u{x, y, z). The boundary conditions (j2.195[) now become 



u{i,7j, c) = s(~i, r,, c) = u{t 1, c) - «(e, -1, = s(e, V, 1) = ii(e, -i) = o (2.197) 
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that is, they are simply Dirichlet conditions imposed on the boundaries of the domain —1 < 
S.,VtC ^ Ij precisely as we have for the elliptic equations that govern the metric variables in our 
model when expressed in the rj, C) system. Now, clearly, the transformation to compactified 
coordinates increases the algebraic complexity of any elliptic PDE to which it is applied, and 
introduces additional terms involving first derivatives of the unknown. However, dealing with 
these complications within the context of a finite difference approach is not difficult in principle, 
and, at least naively, it seems reasonable to expect that the extra computational cost involved will 
be more than offset by the increase in accuracy and improved convergence properties that the use 
of exact Dirichlet conditions will provide. 

Unfortunately, these expectations are not met when one factors in the combination of 1) the 
specific technique that we have adopted to solve the discretized elliptic equations, and 2) the 
need to eventually have a code that runs in parallel on multi-processor architectures. We can 
summarize the situation as follows. As described in detail in Chap. HI we have chosen the multigrid 
method to solve our finite-differenced elliptic PDEs due to its efficiency: in particular, it is the 
only available technique that can solve discretized forms of general nonlinear elliptic systems using 
0{N) calculations, where N is the total number of discrete unknowns. However, as we will now 
discuss, the use of compactified coordinates induces a major stumbling block to the parallelization 
of the multigrid algorithm. 

Consider the following schematic form for a Poisson-like equation, where the coordinate system 
(x, y, z) may be some general curvilinear system such as 77, Q) 

O'^ii d'^u O^u 

^(^' + + '^(^' y^ = p(^' (2.198) 

From the coefficient functions A{x^ y, z), B(x, y, z) and C(x, y, z) we can construct the ratios 
|^/C| and \B/C\. If any of these ratios exhibit large variations in magnitude on the solution 
domain, then the PDE is said to be anisotropic. It is clear, then, that the compactified Poisson 
equation (|2.196p . is highly anisotropic in this sense, since each of the ratios |^/-B|, l^/C*] and \B/C\ 
ranges from to oo for — 1 < ^, 77, C < 1 

Now — and again as discussed in more detail in Chap. |3] — to construct an efficient multigrid 
algorithm, we must have a way to efficiently smooth the errors in the discrete solution independently 
of the size of the mesh spacing that is used in any given calculation. For equations that are 
not anisotropic, simple relaxation methods such as point-wise Newton-Gauss-Seidel (see App. |D]) 
tend to be very effective smoothers. Importantly, point-wise techniques are readily parallelized 
because the operation of updating any given unknown during a relaxation sweep requires only 
"local information" : that is the update operation only involves values of other unknowns which 
are directly coupled through the finite difference equations themselves. For the type of finite 
differencing used in this thesis (see App. |B|) this typically amounts to nearest-neighbours in each 
of the coordinate directions. 

For the case of anisotropic elliptic operators, however, point-wise relaxation will generically 
fail to be an effective smoother [3S]. For example, consider the situation where the ratios 
and |A/C| for the schematic equation (|2.198p satisfy « |^/C| ^ 1- Then the equation 

is anisotropic and is said to be strongly coupled in the x direction. As shown in |35j . point- wise 
relaxation tends to smooth only along directions of strong coupling, so in this instance, the point- 
wise approach will not provide effective smoothing along the y and z directions. Now, there are 
well known strategies for recovering a good smoother when facing anisotropy. Chief among these 
is the use of line relaxation, which for the current example would involve the simultaneous update 
of all of the unknowns in the x direction for each discrete combination of {y, z). Unfortunately, line 
relaxation destroys the locality of the smoothing process, and thus inhibits parallelization. This is 
the main reason that we decided not to implement compactification in the version of our code that 
was used to generate the results described in this thesis. 

Before concluding this section, however, we point out that there is another approach to deal 
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with anisotropy in multigrid that seems very promising for our apphcation. This is the technique of 
semi-coarsening [1491 1150] , whereby the coarsening operation inherent to aU multigrid algorithms 
is first performed only along directions of strong coupling. The coarsening process tends to weaken 
the degree of coupling in those directions, so that point-relaxation can again be used as a smoother. 
For compactifying transformations of the form (|2.19ip the effective strong-coupling directions will 
be location dependent, so that we would need to implement an algorithm that cycles through the 
three coordinates ^, rj and Ci semi-coarsening along each direction in turn. However, all of the 
operations involved in this method, including the semi-coarsening itself would remain local, and 
thus the method would be amenable to parallelization. 



Asymptotic and Sommerfeld Boundary Conditions 

The next strategy we considered for treating the boundary conditions (|2.186p - (|2.190|) is the one 
that is probably the most widely used in current 3D numerical relativity codes. Here, the underlying 
idea is to work in (non-compactified) (x, y, z) coordinates and replace the spatially infinite domain 
with a finite region defined by 



< X < x„ 



2/min <y<yn 



< z < z„ 



(2.199) 



where 



max; '^min; -2'max; 



become adjustable parameters of the computation. We 



then use the known and/or expected behaviour of the solution unknowns as r — )- oo |120| ^ to 
derive approximate conditions which can be imposed on the boundaries of the domain. 

For example, considering the conformal factor, ip, asymptotic fiatness of the spacetime requires 
that 

lim V==l + - + 0(r"2), (2.200) 

where fc is a constant. Using a trick that is well known to numerical relativists |151| . we can 
convert (|2.200p to a boundary condition of mixed type as follows. We first differentiate (|2.200p 
with respect to r to get 



lim^ = -A + 0(.-). 



This implies that 



k = lim 



dr 



+ 0(r-i). 



Using this result to eliminate k in (j2.200p we have 



(2.201) 
(2.202) 



lim V = l-r^+0(r-2) 

r— ^■oo OT 



(2.203) 



lim 



dr r 



= Oir 



(2.204) 



Neglecting the 0{r term in the above, the condition that we impose on tp on the boundaries of 
the finite domain is then 

^ + t^ = Q. (2.205) 
or r 

In terms of Cartesian coordinates, {x,y, z), we have 



dip dip dx dip dy dip dz x dip y dip z dip 
dr dx dr dy dr dz dr r dx r dy r dz ' 



^ Again recall that r = y^x^ + y^ + z^ . 
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where the relations dx/dr = x/r etc. fohow from the standard transformation from rectangular to 
spherical polar coordinates. Using the above result in (|2.205p we have 



d-ip 9-0 dij] 



dx 



y 



dy 



dz 



(2.206) 



Similar boundary conditions of mixed type can be derived for the lapse function, a, and the 
shift vector components, (3^ , from the asymptotic behaviours: 



lim a = 1 + 0(r"i) , 



(2.207) 



lim = 0{r' 



(2.208) 



For the case of the scalar field variables, (pA and Ha, we also have fall-off conditions, but 
additionally must take into account the radiative nature of the fields. The situation is further 
complicated by the fact that since the field is massive, it has a non-trivial dispersion relationship. 
The simplest — and fairly crude — approach is to treat the scalar field as if it were massless. Letting 
v(t,r) denote any of the (j)A, in the asymptotic region r — > oo, we then expect 



lim u(t, r) 



h{t - r) 



(2.209) 



where /i is a function that describes a purely outgoing disturbance propagating at the speed of 
light (i.e. with speed c = 1 in our units). If we impose ()2.209|) as a boundary condition, then we 
arc requiring that there be no incoming radiation (from infinity) at any time during the evolution: 
such a demand is often called a Sommerfeld boundary condition. 

The Sommerfeld condition in the form (j2.209p cannot be implemented directly in a numerical 
calculation since h is not a known function. However, observing that (|2.209p implies 



lim (rv) ~ h{t 

r^oo 



(2.210) 



we immediately have 



or 
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r— >cx3 
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- [rv) + - irv) 



0, 
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r-^ + r— + V 
at or 



In Cartesian coordinates, and dropping the lim.r_ 



= 0. 
this becomes 



■rdv 



dv dv 
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Z^— h X— h J/t^ h Z— = 
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(2.211) 
(2.212) 

(2.213) 



The remainder of this section concerns technical details and issues related to our numerical 
implementation of boundary conditions exemplified by (|2.206p for the metric variables, and (|2.213p 
for the scalar field quantities. As such, we again note that the reader may first wish to con- 
sult Chap. (|H) as well as App. ([B]) for background information on the specific finite differencing 
techniques that we used to discretize our model. 

The key piece of (|2.213p on which we need to focus is 



dv 
dx 



dv dv 
dy 



dz 



(2.214) 



^^We note that a mixed boundary condition such as l|2.206|l . which involves the value of the function and its 
normal derivative, is sometimes called a Robin condition. 
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Observe that an expression of identical form appears in the boundary condition (|2.206p for the 
conformal factor, ?/;, and also appears in the corresponding equations for a and that we have 
not displayed here. Denoting the coordinates of the finite difference grid by 

{xi,yj,Zk), i = l...n^, j = l...ny, fc = 1 . . . (2.215) 

(see Sec. l4.ip . we consider the boundary x ~ Xmin which is a; = xi on the grid. Noting that our grid 
uses the same (constant) mesh spacing, h, in each of the coordinate directions, our finite difference 
version of (|2.214p is 

V2J,k - Vij,k 1 f Wl.j + l.fc - Vl.j-l,k , V2,j+l.k - V2,j-l.k 

"^^/^ h + 2^-'l 2h + 2/1 

1 / Vl.j,k+1 — Vl,j.k-1 V2.j,k+1 — V2.j,k-1 

+ 2^n 2/1 + 2h 

j ^2...ny~l, fc==2...n^-l (2.216) 

Note that this formula is centred at the "fictitious" grid point {x^/2-, yj,Zk) = {{xi + X2)/2, yj, z^), 
and that due to this centring the approximation of the x derivative is 0(h?) while only involving 
grid function values at xi and X2- Similar formulae are readily derived for the other boundaries 
X = a:max, y = J/min: V = 2/max, z = z^in and z = Zmax- Additionally, we stress that expressions 
such as (j2.216p cannot be used along the edges or at the corners of the grid. There we use modified 
versions of (|2.216p that maintain the basic strategy of centring the formula at a fictitious grid 
point (defined as the centre of 4 points along the edges, and 8 points at the corners). Wc note 
that although we expect that this differencing technique should work adequately for the scalar 
field variables (maintaining stable evolution in particular) the current version of our code has some 
convergence problems when stringent tolerances are set for the overall time-step iteration (see 
Sec. 15. ip . We believe that this is due to one or more code bugs, but have not invested much time 
on the matter given the issue with the elliptic equations that we now discuss. 

Indeed, use of the same differencing scheme to discretize the boundary conditions — such as (|2.206p - 
imposed on the elliptic PDEs led to severe numerical difficulties that we have yet to resolve. Once 
more the issue is related to our use of multigrid to solve the elliptic equations. Even though our ba- 
sic relaxation method (used in multigrid as a smoother, as discussed briefly in the previous section) 
converged to a smooth solution when applied to the full set of difference equations for the elliptic 
unknowns, our multigrid algorithm would not converge. We strongly suspect that the multigrid 
failure can be traced to our use of the particular differencing strategy for the boundary conditions 
that we have outlined above. Heuristically, the use of centring at the fictitious points means that 
the boundary conditions arc actually being imposed in the interior of the domain, rather than on 
the boundaries per se. During the execution of a multigrid cycle, as coarser and coarser grids come 
into play, the locations where the boundary conditions are set thus penetrate further and further 
into the solution domain, and this seems to destroy the convergence of the method. 

One possible remedy for this problem would be to augment the grid with so-called ghost points, 
and then to impose discretized versions of the interior PDEs as well as the boundary conditions 
at the boundary points. Typically, when this approach is adopted one then eliminates the ghost 
values that are referenced by the interior equations using the discrete boundary conditions. How- 
ever, although this strategy works well in ID and 2D, it is not so straightforward to implement 
in 3D, especially when the PDEs involve mixed derivatives as is the case for our system. Addi- 
tional complications arise due to the need to maintain smoothness near the boundaries during the 
multigrid process, as well as in formulating appropriate transfers within the grid hierarchy that are 
compatible with the boundary conditions. 

In brief, although we devoted considerable effort to the task, we were not able to construct a 
convergent multigrid solver for our elliptic equations in the case that discrete versions of asymptotic 
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boundary conditions such as (j2.206p were used. However, we remain hopeful that a resolution to 
this problem does exist, and plan to continue to work towards it. 



Dirichlet Boundary Conditions 

Our third approach to the numerical treatment of the boundary conditions for our model, and the 
one that was incorporated in the code used to generate all of the results presented in Chap. [5j 
is extremely simple. We simply set the conditions (|2.186p - (|2.190|) on the boundaries of the finite 
domain defined in the previous section: 

•^min — — -^max ; ^min — V — ^max ; -^-min ^ ^ ^ ^max : (2.217) 

Specifically, we have 

U{t,Xniin,y,z) ^U(t,X^i,^,y,z) ^ 1 

U {t, X, ymin, z)^U (i, X, y„iax, z) = 1 

U{t,x,y,z„-,in) ^ U{t,x,y,z-^^^) = 1 (2.218) 
where U represents a or and 

V(t,Xmin,y,z) = V{t,Xmi,^,y,z) = 

V{t, X, J/min, z) = V{t, X, J/max, z) = 

)^V{t,x,y,z^,^) = (2.219) 

where V is any of the (3'', (pA or 11^. With this choice, we were able to construct a convergent 
multigrid solver for the elliptic equations, and, as described in Chap. [S] our numerical evolutions of 
the coupled elliptic-hyperbolic system of PDEs governing our model were stable and convergent. 

Although this choice certainly constitutes quite a crude approximation, especially for the domain 
sizes that we have used in our calculations to date, we note that we ultimately intend to incorporate 
adaptive mesh refinement techniques |152| into our code. Particularly for scenarios involving the 
interaction of two boson stars, we anticipate that fine grids will be needed only in the central region 
where the stars interact. The hope is then that a "telescoping" sequence of ever coarser grids will 
allow us to increase the physical size of the numerical solution domain to the point where the use 
of Dirichlet conditions contributes a relatively minor amount to the overall error in the solutions. 
Again, however, this is a matter for future investigation. 



2.4.6 ADM/York Mass 

In the Hamiltonian formulation of general relativity, the total spacetime energy — also called the 
ADM mass — is associated with the numerical value of the Hamiltonian, which at any time, t, is 
to be computed on a surface, dV, at spatial infinity. The surface encloses the entire volume V of 
the spacelike hypersurface Et . For an asymptotically flat spacetime, the total energy on Ei can be 
expressed as a surface integral involving derivatives of the components of the 3-metric at spatial 
infinity. In terms of the 3 + 1 variables and Cartesian coordinates, this integral is 

Madm ^E^^ = -^\un I (^-^) N^dS . (2.220) 

Here r = \/a;^ + + , Hoo is the numerical value of the Hamiltonian at spatial infinity, 

is a outwards-directed unit vector normal to the surface dV, and dS is the area element on the 

surface. That is, dS = yfqd^y, where q is the determinant of the induced metric on dV and d^y 
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are differentials associated with the coordinates on the surface. For example, if dV is a spherical 
surface at i^, then adopting the usual spherical polar coordinates, {r,0,(j)) the measure is simply 
^5* = ^cPy = r^smOdOdct) and the unit normal, N\ has components (TV, TV*, iV^) = (1,0,0). 

In situations such as ours, where the outer boundaries of the computational domain are not 
in the vicinity of spatial infinity, it is problematic to use (j2.220p to compute a good estimate for 
the ADM mass. Fortunately there is an alternate expression for Madm due to O Murchadha and 
York [153j that provides the basis for a more accurate calculation of the mass in our simulations. 
O Murchadha and York investigated the relationship between the ADM masses of two spatial 
metrics that were conformally related, i.e. for 7,^ = tp^jij- Starting from (|2.220p . they showed that 
the difference in energy associated with the two metrics is given by the following volume integral 
over the spacelike hypersurface: 

Wtt (Madm - Madm) = {f'd.d,^) d'x. (2.221) 

This expression takes a particularly convenient form when the conformal metric is flat, since in 
that case the conformal ADM mass, Madm, vanishes and we haveF^ 

IGttMadm = -8 / V7(f' 9,9ji^) d\. (2.222) 
Jv 

Written in Cartesian coordinates this becomes 

Madm = {i\xx + i^,yy + ^,zz) d^x . (2.223) 
27r Jv 

In the simulations of our model problem we evaluate the above volume integral numerically, and 
use the value of Madm computed in this way as a diagnostic quantity. In this regard we mention 
that the question of whether Madm is exactly conserved in our model remains, to our knowledge, 
an open one. It is thus noteworthy that our simulations to date suggest that it is a conserved 
quantity. However in the absence of a theorem to this effect, we emphasize that our use of the 
conservation of A/adm during an evolution as a measure of the accuracy of the calculation should 
be viewed with caution. 



Murchadha and York thus noted that this result provided an alternate definition of Madm that was valid for 
for any asymptotically conformally flat hypersurface. 
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2.5 Overview of the Equations of Motion 

In previous sections we have discussed the derivation of the PDEs that govern our model in con- 
siderable detail. Here we provide a recap of that development and redisplay the equations in a 
schematic form that emphasizes the principal parts of the various differential operators involved. 
In addition, we provide a brief commentary concerning some of the properties of the PDEs as well 
as few details about how they were solved numerically. We also note that we use a standard tensor 
calculus notation for partial differentiation below: namely that f^x = df/dx, f^xx = d^f/dx^ etc. 

In summary, our equations of motion constitute a mixed elliptic-hyperbolic system of 9 nonlinear 
PDEs. There are 5 quasi-linear, elliptic equations for the geometric variables, which include the 
lapse function, a, the conformal factor, tp, and the three components of the shift vector, In 
addition, there are 4 quasi-linear, hyperbolic equations which govern the components of the complex 
scalar field, 01 and (/)2, and their conjugate momenta. Hi and 112. 

The elliptic equations were derived as follows. The maximal slicing condition for a resulted 
from taking the trace of the evolution equation for the extrinsic curvature K'j, and demanding 
that the right hand side vanish. Following some manipulation (including the use of the Hamilto- 
nian constraint to eliminate the 3-Ricci scalar) this led to equation (|2.181|) which can be written 
schematically as: 

+ + = (a, j-,0A,nA) . (2.224) 

Here Na is a function which is nonlinear in many of its arguments, including the first-order deriva- 
tives of a, ip and 13''. 

The elliptic equation (|2.182p that governs the conformal factor, -0, came from the Hamiltonian 
constraint and has the form: 

'ip.xx +1p,yy +'ip.zz = (tt, "0, ;0 ^ , 0A, j , H^) , (2.225) 

where N.^ is again nonlinear, but in this case, the key derivative nonlinearities involve only the 
shift vector components. 

Additionally, the momentum constraints were used to derive (|2.183p - (|2.185p . which arc elliptic 
equations for the components of the shift vector (3'^ = (/3^, /3^, /3^). These take the form 

+ IS^yy + + I {P',yx + P^x) = ^ p. («, V, V',,, J , , /3^, , , , , H^ ) , 

(2.226) 
(2.227) 

+ P',yy + \P^z + \ + ii^.y.) = ^ V J , /?^, Z?^, '/'A,. , H^) . 

(2.228) 

Here, the right-hand-side functions iV^i , TV^h and A^^^ , although generally nonlinear, are linear in 
the first order derivatives of the fi' . 

These five elliptic PDEs have to be supplemented by boundary conditions. As discussed in 
Sec. I2.4.5[ for all of the results reported in this thesis, we used Dirichlet boundary conditions as 
follows 

a|B = l, V'|b = 1, ni3=0, /3^|b=0, /31b - 0. (2.229) 

where the notation /|b means "/ evaluated on the boundary of the (finite) solution domain". 

Numerically, and as discussed in more detail in Chap. |4] and App.lBj the elliptic equations were 
discretized using second order finite difference methods, on a uniform mesh (constant spacing h in 
all three of the coordinate directions). The resulting discrete equations were then solved using a 
multigrid algorithm which is described in Sec. 14.31 

The hyperbolic part of our model consists of the four first-order- in-time PDEs (|2.173p - (|2.174p . 
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all of which originate from the general relativistic Klein-Gordon equation for a complex scalar field, 
(j) — (pi + i(j)2, where <j)i and (j)2 are real- valued. These equations are of the form 

</)A =iV^., (a,V',/3\'/'A,j,n^) (2.230) 

and 

Ha =^^n^ {oi,a^j,%p,^^j,|3\|3\^,(j)A,(t)A,j,(j)A,jj,^A,^A.,j) , (2.231) 

for A = 1,2, and where YIa are the momenta conjugate to 0^. In this case we observe that the 
source functions N^j^ and iVn^ arc linear in the scalar field quantities themselves. As with the 
elliptic equations, we impose Dirichlet boundary conditions on the hyperbolic variables: 

(0a)|b=O and (n^)|B== 0. (2.232) 

The hyperbolic equations were also discretized via second order finite difference techniques, 
and on the same grid used for the elliptic PDEs. More specifically, we used an implicit Crank- 
Nicholson scheme (Sec. 14.1.31 for details), and the resulting algebraic equations were solved with a 
point-wise Newton-Gauss-Seidel iteration (App.lPjfor details). Again, Chap.|4]and App.lBl provide 
many more details concerning all of the major numerical techniques and algorithms we used in the 
current work. 

The reader will note that we have not yet discussed the issue of initial data for our model 
which, of course, is an absolutely crucial part of the complete specification of any initial-boundary 
value problem. Modulo potential difficulties due to the nonlinearity of the equations, we can, in 
principle, specify arbitrary values for the scalar fields, 0^, and the conjugate momenta, 11^, as long 
as those values are sufficiently differentiable,[^ and compatible with the boundary conditions. The 
elliptic equations can then be solved to determine the initial values for the metric functions. The 
future (or past) development of the initial data will then be given by the solution of the coupled 
elliptic-hyperbolic system, subject to the boundary conditions. 

However, our principal aim is to study the interaction of boson stars, which, we recall, are 
localized, gravitationally bound configurations of the scalar field that, in isolation, produce time- 
independent gravitational fields. The process of determining initial data for even a single such star 
is a non-trivial matter: generating initial states for binaries adds a few more complications. We 
thus devote the entire next chapter to the issue of computing this specific class of initial data. 



^This assumes that the scalar field potential has only a mass term, which is the case for the calculations reported 
here. 

■^''Mathematically, it would suflice to have initial data that is twice diflferentiable, but especially given that we are 
using centred finite difi'erence techniques, it will be more convenient to require the data to be smooth — i.e. infinitely 
differentiable. 
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Chapter 3 
Initial Data 



In this chapter we describe the computation of initial configurations for our model that contain 
boson stars. Although we are most interested in the case of initial data describing boson star 
binaries, the problem of determining data for a single star is interesting in its own right, and, of 
course, forms the basis for the task of constructing an initial state for two stars. To generate data 
for one star, we adopt a particular ansatz, detailed in Sec. 13.41 in which both the spacetime and the 
scalar field, </>, are spherically symmetric, and where due to an assumed harmonic time-dependence 
for (p, the spacetime is time independent. 

Now, it transpires that for spherically symmetric spacetimes, the requirement of conformal 3- 
flatness does not imply that we are considering an approximation of the full Einstein equations, 
as it does in the general 3D case. Rather, as we will discuss in Sec. 13.21 in spherical symmetry 
conformal flatness amounts to a specific choice of radial coordinate, r. that we can always make, 
and which we can further combine with maximal slicing to completely fix the spherical coordinate 
system. For this reason the first five sections below treat the full Einstein equations coupled to a 
complex scalar field (with some general self- interaction potential), and we thus begin in Sec. 13.11 
with a review of the 3 + 1 equations for spherically symmetric spacetimes. Additionally, since it is 
easiest to determine the boson star solutions using a so-called areal radial coordinate, R, and then 
transform the solutions to the coordinate r in which the 3- metric is conformally flat, we discuss the 
relevant Einstein equations in both coordinate systems in Sees. 13.31 and 13.21 Sec. 13.41 then defines 
the ansatz used to generate a single boson star. This reduces the field equations to a system of 
ordinary differential equations ODEs for the scalar field and metric variables, which is further an 
eigenvalue problem. We briefly discuss how the system is solved, and then in Sec. 13.51 comment on 
some of the properties of the solutions. 

The last two sections of the chapter then return focus to the 3D case, and the use of the results 
from the spherically symmetric general rclativistic calculations to provide initial conditions for 
our model. Sec. 13.61 describes the straightforward process that we use to set up initial data in 
Cartesian coordinates for one or two stars, and Sec. 13.71 details the further transformations of the 
field quantities that are used to give stars non-vanishing velocities at the initial time. 

We emphasize that apart from the material in Sees. [3^ and [3771 none of what is discussed here 
is original to this work. Indeed, general rclativistic spherically symmetric boson stars have been 
studied by a large number of authors, so there is a considerable literature on the subject (see |119] 
for a review). In particular, much of the presentation in Sees. 13. l l H3. 51 follows Chap. 4 of Lai's PhD 
thesis [128j (which the interested reader can consult for additional details) as well as unpublished 
lecture notes due to Choptuik |154) . 

3.1 Spherically Symmetric Spacetime 

In general relativity, spherical symmetry can be precisely defined in terms of the symmetry group, 
S'0(3), whose group orbits are 2-spheres, and which, physically, is associated with rotations in three 
dimensional space |120| . In particular, spherical symmetry dictates that the spacetime metric, gab, 
be invariant under the action of 5*0(3). Choosing a set of coordinates {t,r,6,(j)) which is adapted 
to the symmetry, the metric induced on any 2 sphere (i.e. any collection of events defined by 
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t = const., r = const.), is some multiple of the metric on the unit 2-sphere, which itself is given by 

dn^ = de^ + sin^ ed(i)^ . (3.1) 

It is straightforward to argue that a completely general form for the line element on a t = const 
hypersurface in spherical symmetry is then 



(3)ds2 = a^it, r) dr^ + r'^b'^{t, r) dfl^ 



(3.2) 



where we stress that the functions a and b depend only on t and r. Similarly, it can be readily 
established that the most general line element for spherically spacetime can be written in the 3+1 
form 

ds^ = (-a(<, rf + a^(3{t, rf) dt^ + 2a^fi dtdr + a^dr^ + r'^b^dif. (3.3) 

Here, a[t^ r) is the lapse function as usual, and r) is the single non-vanishing component of the 
shift vector /J*^ = (/J'', ^'^) = (/3,0,0). 

Additionally, it is not hard to prove from ()2.44p that the extrinsic curvature corresponding to 
the spatial metric, 7^ , has only two independent components in spherical symmetry, K^r and g: 



K'j = diag{K\it, r), K%{t, r), K%{t, r)). 



(3.4) 



Before writing down the 3 + 1 Einstein equations for spherically symmetric spacetimes, it is 
convenient to introduce auxiliary fields, r) and n(<, r), which are defined in terms of the 
complex scalar field, </> as follows: 



$(i,r) = 0'(t,r) = 9,(/)(t,r) 



and 



-(0-/30'). 



(3.5) 



(Here and throughout this chapter we use the overdot and prime notations for partial differentiation 
with respect to the time and radial coordinates respectively.) Ell In terms of these auxiliary 
variables, and using ()2.40|) - (|2.43|) . the non- vanishing components of the stress-energy tensor are: 
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Jr 



H*$ + n$* 
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IHP - |$P 
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2a' 



where J7(|0p) is the scalar field's interaction potential. The spherically symmetric Klein Gordon 
equation gives us the following first-order-in-time evolution equations for the scalar field variables, 
0, $ and H: 



= -H + /3$, 



rr 



{rbf (/3H + - 
V a 



(rb) 



a/Ye - 13 



{rby 



n — aa 



(3.6) 
(3.7) 
(3.8) 



^^As a technical note, we point out that 11 as defined in l|3.5|l is not the conjugate momentum of the field if>; but 
that r^b^ sin(S)n is. The definition 13.51 1 is motivated by the form of the dynamical equations for <& and 11 that 
result (vis. (IgTt-lSTSt). 
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Continuing, from (|2.44p we have evolution equations for the metric functions a and b: 

d = -aaK''r + {a(3)' , 

r 

and from (j2.45p . evolution equations for the extrinsic curvature components, K^r and Kg 
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+ aKK\ - ^naU{\(l>\^). 



(rb)^ a{rb)'^ 

Turning to the constraint equations, the Hamiltonian constraint (j2.49p reduces to 

TZ + AK\K% + 2K\'^ = 167rp , 
which, using the explicit form of the 3-Ricci scalar, TZ, becomes 

' '{rby 



(3.9) 
(3.10) 

(3.11) 
(3.12) 

(3.13) 
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Finally the momentum constraint (|2.50p is 

(rb)' 



rb 



{K\ - K\) ^ 2^ 



n*$ + n$* 



a 



(3.14) 
(3.15) 



We have now displayed a complete set of equations for the Einstein-Klein-Gordon system in 
spherical symmetry, which is valid for any system of coordinates (i, r, 0, ^) that are adapted to 
the symmetry. In the next two sections we specialize these equations to the cases of two specific 
coordinate systems that bear on our current work. 



3.2 Maximal-Isotropic Coordinates 



As we discussed in Sec. 12.41 the choice of maximal slicing fixes the time coordinate by demanding 
that the trace of the extrinsic curvature, K = K^i, vanish at all times. To implement this condition 
in spherical symmetry then, we require K{Q, r) = and K{t, r) for all t and r. We note that since 
we have K''j = {K^r, K^g, K^g), and thus K = K''' r + 2K^ g, the choice K = allows us to eliminate 
one of K^r or K^g from the overall set of equations. In particular, we will take 



1 



(3.16) 



in the following. The specification of the coordinate system is completed by fixing the radial 
coordinate. We do this by requiring that the 3- metric be conformally flat, so that again introducing 
the conformal factor, 'ip{t, r), as in Sec. 12.4.21 we have 



(3.17) 



We reemphasize the point made earlier in this chapter that in spherical symmetry conformal flatness 
amounts to a coordinate choice: we are always free to write the 3-metric in the form p.l7p . and 
it does not imply that we are approximating Einstein's equations in some way, as it does when 
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applied to the generic 3D case. 
In terms of the general form 



for the spherically symmetric line element, p.l7p implies 

a = b = i''^{t,r). (3.18) 



Traditionally, the radial coordinate implied by p.l7p . or equivalently (jS.lSp . has been termed 
isotropic, and, although we will also use that terminology, the reader should keep in mind that 
"isotropic" is synonymous with "conformally flat" in this context. 

Operationally, to implement the isotropic condition p.l8|) . we require that it holds on the initial 
hypersurfacc, so that a(0,r) = 6(0, r), and that d{t,r) — b{t,r) for all t and r. Equating the right 
hand sides of p.9|) and (|3.10p and using a{t,r) = b{t,r), we easily derive the following ODE for 
the shift vector component, /3: 



r 



or, using p.l6p . 



(3.19) 



(3.20) 



Returning to the maximal slicing condition, wc have K = Q and K = Q. We thus take the trace 
of (|2.45p and equate the right hand side of the resulting equation to 0. We then use the Hamiltonian 
constraint to eliminate the 3-Ricci scalar, TZ, appearing in the equation in favour of terms involving 
the extrinsic curvature and stress-energy components, and perform additional simplifications using 
A' = and e = —K'^rl'^- After some manipulation we find the following equation for the lapse: 



(rhf 



{rb) 

which using a ~ b ^ , becomes 



47ra2i7(|0n - 87r|n|2 - j^a'^K\'^ 



a = 0. 



d 
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47rV'''C/(|0n - 87r|n| 



2^ 



a = 0. 



(3.21) 



Here we note that the operator d/dr^ takes a derivative with respect to r^. Its use here is mo- 
tivated by the need to maintain regularity in numerical solutions of p.21[) . and is an example 
of a general technique first introduced to the numerical relativity community by Evans |155) . 
The basic observation is that as r — >• 0, we have ip{t,r) ipoit) + r^ip2{r) + 0{r'^), and thus 
r'^4'{t,r) — >■ r^'0o(i) + 0{r'^). Since the leading order term goes like r^, finite difference operators 
based on d/d{r'^) rather than d/dr tend to produce numerical solutions that arc smoother near 
r = 0. This issue is discussed in more detail in App. E Sec. El 

Eqs. ((3?2T1) and ((3?20)) determine the kincmatical geometrical variables, a and /3, respectively. 
We now need equations for the dynamical geometrical variables, namely the conformal factor, 
■0, and the extrinsic curvature component, K^r- Here, due to the overdetermined nature of the 
Einstein equations, which, we recall, results from the coordinate invariance of the theory, we have 
two choices for each function. We can either use evolution equations derived from p.9p - p.lip . or we 
can use the constraint equations (|3.14p and (|3.15p . Here we choose the latter option, which produces 
what is known as a fully constrained scheme [122] . Specifically the Hamiltonian constraint p.l4p 
becomes an ODE for ip: 
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(3.22) 
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while the momentum constraint p.lSp provides an ODE for A'',.: 



47r 



— (n*$ + n$*) 



(3.23) 



We note that our abihty to completely determine the geometric variables without the explicit use 
of any evolution equations (and, in particular, without using any equations of the form K^j = ■ ■ ■) 
is a reflection of the fact that, in spherical symmetry, the general relativistic gravitational field 
has no independent dynamics: genuine time dependence of the metric must result from time 
dependence of a matter field that is coupled to gravity. 

Finally, the specialization of p.6p - p.8p to maximal- isotropic coordinates yields the following 
set of evolution equations for the scalar field variables: 



n = — r 



3 d 
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1 Kr ^.(r^b'y 



n — aip' 



(3.24) 
(3.25) 
(3.26) 



The set of equations p.20p - p.26p fully determine the spherically symmetric Einstcin-Klein- 
Gordon system in maximal-isotropic coordinates. Note, however, that these equations must be 
supplemented with initial values and boundary conditions in order to generate a unique solution. 
However, we have not used this system to generate any of the results discussed below, and have 
included the derivation of the equations of motion primarily for the sake of completeness. We 
will thus conclude our discussion of this particular coordinate choice at this point and refer the 
interested reader to App. B of Lai's thesis |128| for details of a numerical implementation of p.20[) - 
p.26p . When we return to the maximal-isotropic coordinate system in Sec. 13.41 it will be in the 
context of the initial data problem. 



3.3 Polar-Areal Coordinates 

The second coordinate system for spherically symmetric spacetimes that we consider also uses a 
condition on the extrinsic curvature to fix the time coordinate. In this case we demand that 

K = IC, = K'^r , (3.27) 

where we note that we use T and R in this section to denote the time and radial coordinates, 
respectively, with ii = du/dT and u' = du/dR. Eq. p.27p is known as the polar slicing condi- 
tion [156| , and, as for the choice of maximal slicing, has the immediate consequence of reducing the 
number of independent components of the extrinsic curvature. From the definition of K and p.27p 
we have 

K = K"r + 2K\ = K'^R , (3.28) 

which implies 

K\=0. (3.29) 

This last result is then used to derive an equation that must be satisfied by the lapse function, 
a{T,R), at any time T. Specifically, we require that the initial data satisfy K^g{Q,R) ~ 0, and 

^■^ As opposed to apparent time dependence arising from a choice of coordinates, which can in principle be identified 
by finding a solution of the Killing equations C^gab = 0, where ^ is the sought-for timelike Killing vector (sec Sec. 13. 4)1 . 
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then impose k%(T, i?) = for all T and R: 

K\{T, R) = k%{T, R) = 0. (3.30) 

As the name "polar-areal" suggests, the spatial coordinate is fixed by demanding that R directly 
measure the proper surface area of any T = const., R = const. 2-sphere. Now, in terms of our 
general form p.2p for the 3-metric, this area, A{T, R), is given by 

A{T, R) = At:R%{T, Rf , (3.31) 

so if R is to be areal, we must have &(T, R) = 1. Thus, the choice of radial coordinate eliminates 
another of the geometric dynamical variables from the system. Moreover, the general evolution 
equation (|3.10p reads 

b = -abK\ + 4 {Rb)' (3.32) 
R 

in the current case. Parallelling the implementation of the other coordinate conditions discussed 
thus far, to enforce 6(T, R) = I we require &(0, i?) = 1 at T = 0, and &(T, i?) = for all T and R. 
Using these relations, as well as K^g{T,R) = in (|3.32p . we find that the shift vector component, 
/3, must identically vanish. Thus the spacetime line-element in polar-areal coordinates is simply 

ds^ -a2(T, R)dT^ + a^{T, R)dR^ + R^d^f , (3.33) 

which can be viewed as a natural generalization of the familiar Schwarzschild line element 

ds^ ^-(^-'^^dT'+{l-'^^ 'dR^+R'dn^ (3.34) 

to the case of time dependent spherical geometries. We further note that the elimination of three 
of the geometric quantities — namely &, K^g and f3 — using two coordinate conditions is a special 
feature of the polar-areal system. 

Not surprisingly, given the simple (and diagonal) form of the line element (j3.33p . the resulting 
3-1-1 Einstein equations also assume a very simple form. Once more, since only two dynami- 
cal geometrical variables remain — the metric function, a, and the extrinsic curvature component, 
K^jf — we can use the constraint equations in lieu of evolution equations to determine them. 

Using 6=1 and K^e = in the general form of the Hamiltonian constraint p.l4p . we find that 
a must satisfy the ODE 

a' = i (1 - a^) + 47TRa + jnp + a'um^)] } , (3.35) 

while the momentum constraint p.l5p provides an algebraic equation for r: 

= _^(n*$ + n$*) . (3.36) 

a 

As discussed above, the polar slicing condition requires that K^e{T, R) ~ and K^e{T, R) = 0. 
Setting the right hand side of p.l2p to 0, and using K = 0, f3 = and b = 1, we derive the following 
ODE for the lapse: 

, a ( — 



— + Ar^R + |n|2 - a'um')] \ ■ (3-37) 
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Finally, for the scalar field variables, using b ~ 1, K^e ~ and /3 = in p.6l) - (|3.8l) we find: 



-n. 



n 



a(i?3) 



a 



(3.38) 
(3.39) 
(3.40) 



The set (|3.35p - p.40p eonstitutes a sufficient set of equations for the spherically symmetric 
Einstein-Klein-Gordon system in polar-areal coordinates (again, modulo initial values and bound- 
ary conditions) . In the next section we use a further simplified subset of these equations to compute 
solutions representing single boson stars. 



3.4 Constructing Boson Stars: The Static Ansatz 

A spherically symmetric, localized, time independent configuration of matter captures the simplest 
notion of a star. Provided that the matter is regular everywhere, we should expect such a config- 
uration to produce a gravitational field that is also spherically symmetric, time independent and 
globally regular. Moreover, if we are to be able to study the time evolution of these objects to any 
significant degree, then they should also be dynamically stable. For the case of a complex field 
it is not possible to construct such states on the basis of time independence of (j) itself. Indeed, 
Friedberg, Lee and Pang |107j demonstrated that in order for a boson star to be in a (minimal 
energy) ground state — a necessary condition for stability — (j) must have harmonic time dependence. 
Thus, we adopt the following ansatz for the complex scalar field 

0(t,r) = </)o(r)e-'"*, (3.41) 

where lo is assumed to be a non-negative real constant, and where, for the time being, (t, ?■) are 
a general set of coordinates for spherically symmetric, time dependent spacetimes as discussed in 
Sec. 13.11 As can be quickly verified by examination of the form of p.69|) . all components of the 
stress-tensor, T^i^, become time independent under this assumption. Consequently, the spacetime 
(i.e. the gravitational field) that is produced can also be expected to be time independent. 

Now, we note that the most generic sense of "time independence" in general relativity means 
that the spacetime has a timelike Killing vector field, in which case the spacetime is said to be 
stationary. Additionally, there is a more restrictive definition which requires that we be able to 
foliate the spacetime with hypersurfaces, E^, such that the Killing vector is everywhere orthogonal 
to these slices. In this case the spacetime is said to be static and it is straightforward to show 
that in coordinates (t,x*) adapted to the timelike symmetry, the metric components g^^, must be 
invariant under the "time reflection" symmetry, t ^ —t |120j . An example of a spacetime which is 
stationary but not static is provided by an axially symmetric star composed of self-gravitating fluid 
that is in rotation about some symmetry axis, and which thus has some net angular momentum. 
Although one can construct boson star models that have angular momentum, we will not consider 
them here, so the demand that spacetime be static is the appropriate choice. We observe that, 
assuming that t is adapted to the timelike symmetry — which we will hereafter require — (j3.4ip is 
compatible with this demand. We will therefore refer to it as the "static ansatz" . 

We now consider what the ansatz implies for the spherically symmetric 3-1-1 equations displayed 
in the two previous sections. First, the static requirement immediately implies that all of the metric 
components, g^^, must be time independent functions, and that in terms of the general 3 + 1 form 
of the line element (|3.3p . the shift vector component /3, must vanish: 



/3(t,r) = 0. 



(3.42) 
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We observe that this last condition is always satisfied in polar-areal coordinates, but that it needs 
to be imposed explicitly when working in maximal-isotropic coordinates. 

Since the specific metric functions a{t, r) and b{t, r) must have vanishing time derivatives, and 
we also have /? = 0, Eqs. p.9p and p.lOp then imply that the extrinsic curvature components also 
vanish: 

K% = K\ = . (3.43) 

As a consequence, for the case of a static solution, polar slicing is also maximal, and vice versa, so 
that to perform a coordinate transformation of a static spacetime from polar-areal coordinates to 
the maximal-areal system, we only have to transform the radial coordinate. 

Considering the scalar field variables, we note that p.4ip results in the following expressions 
for the time and radial derivatives of 6: 



4>'{t,r 

From these relations we then have 

m,r) = 

$(i,r) = </)^(r)e 

We also have 



—iuj—(j)o(r)e 
a 



= no(r)e- 



n*$ + n$* 



0, 



(3.44) 
(3.45) 

(3.46) 
(3.47) 

(3.48) 



which means that the momentum constraint (j3.15|) is satisfied identically. 

We now adopt polar-areal coordinates (t, R) (where we use t rather than T in view of the 
above observation that polar and maximal slicings are identical for static spacetimcs), and use the 
above results in the set of equations for the spherically symmetric EKG system that was given 
in the previous section. After some manipulation, the Hamiltonian constraint (|3.35p . the slicing 
condition (|3.37|) , and the evolution equation for the scalar field p.40p . become the following system 
of coupled ODEs: 



a'(i?) = 






a'iR) = 


a 




2 ' 


I R 




$0 






-( 


l + a^ - 



AirRa 



a'- 



4t:R 



R 



dU[(j)l) uP- 



00- 



(3.49) 

(3.50) 
(3.51) 
(3.52) 



Note that a prime now denotes ordinary differentiation and that the equation (/)g = $0 follows from 
the definition of <I>o in p.47p . We will subsequently refer to this set of equations as the polar-areal 
ODE system. 

We observe that this system contains terms such as a(l — a?)/R and ^o/R which, naively at 
least, appear to be singular at the origin, R = 0. In order to ensure regularity at the origin, certain 
conditions must be imposed on a{R), a(R) and 4)o{R). A thorough mathematical treatment of this 
subject defines a function (or, more generally, a component of a tensor) to be regular at the origin 
if, using Cartesian coordinates, {x^y,z), it has a convergent Taylor series in a neighbourhood of 
{x,y,z) = (0,0,0); i.e. at i? = [156) . For the case at hand (see |156[ 11571 1158| for additional 
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details) this requirement turns out to imply the following limiting forms for a{R), a(R) and 0o(i?): 
mil a{R) = ao + aai?^ + 0{R^) = 1 + aai?^ + 0{R^) , (3.53) 
lim a(R) ^ ao + a2R^ + OiR^) , (3.54) 
H^^MR) = 4>m + h2R^ + OiR^) . (3.55) 

Here, ap, 02, ao, 0^2, <^oo and 0o2 are constants, and the fact that ao = 1 follows from the demand 
that spacetime be locally flat at i? = 0. From p.55p we also have 

lim <I>o(i?) = 2002^ + 0{R^) . (3.56) 

_R-!.0 

Using p.53p - p.56p in (|3.49p - (|3.52p . it is straightforward to show that the system of ODEs is 
regular at i? = 0. 

Since p.49p - p.52p is a system of 4 ODEs, it would seem that we need 4 boundary conditions to 
generate a unique solution. However, there is a twist here which arises from the fact that the ODE 
system is actually to be treated as an eigenvalue problem, where lo is the eigenvalue. Specifically, 
for any choice of interaction potential C/(|(/)p), the solutions of p.49p - p.52p form a one-parameter 
family, where the central modulus of the scalar field, (?io(0), is a convenient choice for the family 
parameter. Since we want the solutions of the ODE system to represent stars, and thus to describe 
(essentially) localized distributions of matter, we must have 

lim (/)o(i?) =0. (3.57) 

For any given choice of 0o(O), and in conjunction with the other boundary conditions that we 
will enumerate below, solutions satisfying p.57p will only exist for a discrete set of values of 
UJ = uj{(j)o{0)) and, further, we will only be interested in the smallest such w, which will correspond 
to the lowest energy boson star for a given 4>o{0). In particular the function 4>o{R) for that choice of 
LO will have no zero-crossings (i.e. it will be "nodeless"): higher energy eigenstates will have 1, 2, . . . 
zero-crossings but, again, will be of no concern to us here. 

Bearing this in mind, and using (j3.53p - (|3.56p . the following constitute a sufficient set of bound- 
ary conditions for the polar-areal ODE system: 

a(0) = 1, (3.58) 

a(0) = ao, (3.59) 

MO) = M, (3.60) 

*o(0) = 0. (3.61) 

where (f>oo is freely specifiable, as is, at least momentarily, ao, but where it is to be understood 
that w = w((/'o) must be determined so that the asymptotic condition ()3.57p is also satisfied. 

We now consider the boundary condition ()3.59p more closely. We note that the eigenvalue w 
enters in the system p.49p - p.52p strictly in the combination iJ^ jo? . Combined with the fact that, 
apart from that term, the slicing equation p.50p is linear and homogeneous in a, this means that 
if we generate a solution of p.49p - p.52p subject to the boundary conditions (|3.58p ~ (|3.60p . then 
we can take 

w kuj, (3.62) 

a{R) ka{R), (3.63) 



where k is an arbitrary positive constant, and still have a solution. Moreover, since the profiles 
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a{R), ipoiR) and $o(-R) will be unchanged under the rescaling p.62p - p.63[) . any solutions obtained 
via such transformations will correspond to the same boson star. This reflects that fact that when 
we adopt a time-slicing condition such as polar slicing or maximal slicing, we can always freely 
rescale a(t, r) via a(t, r) — > /(<) a(t, r) where f{t) is some arbitrary function of the time coordinate, 
and still have a solution of Einstein's equations. The physical interpretation of this rescaling is 
that we have complete freedom as to how we are to assign, or reassign, specific labels to the family 
of hypersurfaces, Et, that foliate the spacetime. 

In practice it is a common and sensible choice to perform this labelling so that coordinate time 
and proper time coincide for observers located at i? = oo, and who are at rest in the slices. This 
means that we want 

lim a{R) = 1 . (3.64) 

As was the case in Sec. 12.4.51 we are now confronted with the issue of dealing with a boundary 
condition naturally expressed at infinity, which is problematic if our numerical computations can 
only extend to finite values of R. Fortunately, in this case there is a simple resolution of this issue 
which is based on the uniqueness of the solution of Einstein's equations in spherically symmetry 
when no matter is present. This is known as BirkhofF's theorem, and the interested reader can 
consult Hawking and Ellis |159| . for example, for a proof. Here we assert that the solutions of p.49[) - 
p.52|) — i.e. the boson star solutions — are characterized by scalar field profiles, (j)Q{R), which fall 
off exponentially after some characteristic radius. Thus, to well within the numerical accuracy to 
which we work in this thesis, the spacetime can be considered to be vacuum for R > i?max, where 
^max is the limit of integration of the system p.49p - p.52p . 

Therefore, in terms of the static form of the 3 + 1 element we adopt for polar-areal coordinates: 

ds^ = -a{Rfdt^ + a{RfdR^ + R'^dfl'^, (3.65) 

Birkhoff's theorem tells us that for R > i?,„ax we must be able to identify p.65|) with the familiar 
Schwarzschild form: 

ds^ = -(^l-'^^dt^ +1^1-'^^ 'di?2 + i?W, (3.66) 

where the constant M is the total mass of the spacetime (i.e. M = Madm)- Note that since 

2M] 



lim 



1 

R 



1 , (3.67) 



the constant time surfaces defined by (|3.66p do satisfy the condition that proper and coordinate 
time coincide at infinity. Comparing p.65p and p.66p we thus have the following limits for the 
metric functions a{R) and a{R): 

lim a^{R) = fl- ^) , (3-68) 



fi^cx) V R 



2M^ ^ 



lim a'iR) = 1 - ^ , (3.69) 
which implies that we should choose the rescaling defined by p.62p and (|3.63l) so that 

a(i?max) = ,p . . (3.70) 
The correspondence a{R) (1 — 2M/R)~^, suggested by p.69|) . also motivates the definition 
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of a useful diagnostic function, the so-called mass aspect function: 

It can be shown that A/(i?) defined in this way does measure the gravitating mass contained within 
an i? = const, sphere, so for i? — >■ oo it must limit to the total ADM mass: 

lim M{R) = M = Madm- (3.72) 

To summarize, to determine a specific boson star solution we perform the following steps: 

1. We choose a particular value, 00 (0), for the central modulus of the scalar field. This consti- 
tutes the specification of the boundary condition p.60p . 

2. Using this choice and the additional boundary conditions 

a(0) = 1, (3.73) 
a(0) = 1, (3.74) 
$o(0) = 0, (3.75) 

we solve the polar- areal ODE system p.49p ~ p.52p on the interval < ^ < -Rmax by de- 
termining the eigenvalue w = a;(0o(O)) such that 0(i?max) ^ 0. Since a{R) and lu will be 
rescaled, we choose a(0) = 1 arbitrarily and for convenience. 

3. Once the solution has been determined, we rescale a{R) and lo using 

uj kuj, (3.76) 

a(i?) ^ ka{R), (3.77) 

where k is chosen so that p.70p is satisfied. 

The only remaining technical issue to be discussed is how we determine the eigenvalue lj in 
step 2 of above procedure. We do this with a straightforward shooting method |160j . This means 
that for any specified value of 0o(O), we must first determine an initial bracket [a;_,a;+] satisfying 
a;_ < cj < Solutions computed using uj- and uj^ will display distinct behaviours as i? — >■ i?max- 
Specifically, we find integration with cj = a;_ results in (t>o{R) — >■ oo as i? — ^ -Rmax, whereas for 
UJ = the integration leads to (poiR) — > — oo. Once the initial bracket has been found, we use 
a bisection method |160] to compute increasingly accurate estimates of w, where after each step 
in the bisection, the appropriate end point of the interval, [a;_ , w+J , is replaced with the current 
estimate (a;_ +uj+)/2. 

The solution method that we have just described was coded in a FORTRAN subroutine called 
bsidpa that takes care of the integration of the polar areal ODE system, the shooting process per 
se and the rescaling of the lapse function. The routine is documented in App. |X]and we note that 
we have made the code available to others who might find use for it. 

Once we have computed a boson star solution in polar-areal coordinates, we can transform the 
solution to maximal-isotropic coordinates, which are compatible with the coordinates we use in 
our 3D evolution code. As mentioned above, for static solutions the time coordinates in the two 
systems are identical, so the transformation only involves the radial coordinates. As shown in full 
detail in App. D of Lai's PhD thesis |128j . this transformation can be made by solving the ODE: 
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subject to the boundary condition: 



1 + y/Ey R 
a 



(3.79) 



Here r and R are the isotropic and areal radial coordinates, respectively, and a = a{R) is the 
metric function that appears in the static, areal form of the 3-dimensional spherically symmetric 
line element 

(3)ds2 = o^dR"^ + R^d^f . (3.80) 



Once R{r) has been determined, the conformal factor ^j{r) is easily computed from 

■0(r) 

with 



/ R 
r{R) 



as follows from an application of I'Hopital's rule. 



R=o 



(3.81) 



(3.82) 



3.5 Boson Stars: Some Basic Properties 

Fig. 13.11 shows four separate solutions of the polar-areal ODE system, computed for central scalar 
field values (^o(O) = 0.01, 0.03, 0.05 and 0.07, with a potential U{\(j)\^) having only a mass term 

U{\(t>\'') = m202 . (3.83) 

Here m is the mass parameter for the field, which, by our choice of units, satisfies m = 1. Each 
plotted solution represents a distinct boson star. 

We note that as ^o(O) increases, the stars become increasingly compact — meaning that the star's 
size (defined, for example, as the radius, R, at which M[R) given by (|3.7ip is 99% of Madm) — 
decreases as 00(0) increases. We also observe that the central values of the lapse, a(i?), decrease 
as 00 increases, while the R = values of the conformal factor, ip{R), increase. All of these trends 
are indicative of the fact that the gravitational self-interaction of the scalar field strengthens as 
4>o assumes larger values. Further, although it is not immediately apparent from the plots, we 
assert that all of these solutions satisfy the appropriate asymptotic boundary conditions, namely 
that limi^^oo tp{R) = linii?.^oo a{R) = 1. The figure also shows graphs of the mass aspect function, 
M{R), for the stars, from which one can see a convergence of Af (i?) — >■ Madm, with the convergence 
being more rapid for the stars defined by larger 00 (0). Finally we note that all four of these stars 
are dynamically stable against radial perturbations, although, as we will discuss shortly, this is not 
the case for all solutions in the one-parameter family. 

Fig. 13.21 displays plots of the ADM mass, Madm, as a function of the central scalar field 
value 00 (0) (left panel), as well as a function of two estimates of the stellar radius, Rgg and 
i?95 (right panel). Rgg and Rg^ arc defined to be the radii for which M{R) — 0.99 Madm and 
M{R) — 0.95 Madm, respectively: both of these definitions are commonly used in the boson star 
literature. We note that M(0o(O)) has an absolute maximum at 0o(O) = 0^ w 0.08 where it attains 
a value M^ax = M (0q) sa 0.633 (again, recall that we work in units in which c = G = to = 1). In 
both plots the solid triangles label stars computed with 0o(O) = 0.01, 0.02, 0.03, 0.04, 0.05, 0.06 
and 0.07 (left to right along the M(0o(O)) curve, and right to left along the plots of AfADM(-R99) 
and Madm (^95))- 

It is possible to show from perturbation analyses, as well as from evolution of the full equations 
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of motion |110[ 11161 11611 [128] that stars satisfying (j)o{0) < (pQ are dynamically stable against radial 
perturbations, while those defined by (/>o(0) > (f>Q are dynamically unstable. The sequences of 
configurations defined by <?!>o(0) < and (po^O) > (f)Q are thus known as the stable and unstable 
branches, respectively. Moreover, with reference to the plot of M((/)o(0)) shown in the left panel of 
Fig. 13.21 it can be shown that every extremum in the plot corresponds to an additional pcrturbative 
mode becoming unstable. We emphasize that all of the results concerning boson star evolutions 
that are reported in this thesis (Chap. [5]) used stars from the stable branch. 

The existence of a maximum mass for our one-parameter family of boson stars, and the fact 
that there is a change of stability at (j)o{0) = (j)Q, is completely analogous to the Chandrasekhar 
limits for spherically symmetric white dwarfs and neutron stars |162| . In both of those instances, 
above some mass limit there is no stable static configuration and the star is prone to gravitational 
collapse (to a neutron star in the case of a white dwarf, and, it is widely believed, to a black hole 
in the case of a neutron star). We note, however, that for boson stars there is no degeneracy 
pressure, as there is for the fermionic white dwarfs and neutron stars. In the bosonic case, the 
effective pressure support that counteracts gravity can be viewed heuristically as coming from the 
uncertainty principle if the system is studied semiclassically, or from the dispersive nature of the 
Klein-Gordon wave equation when the system is studied classically, as it is in our current work. 
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Figure 3.1: Typical Boson Star Solutions. These plots shows the values of <f)Q{R),ijj{R), a{R) 
and M{R) as a function of the areal coordinate, i?, for boson stars defined by 00(0) = 0.01, 0.03, 
0.05 and 0.07. As discussed in more detail in the text, all of these configurations are dynamically 
stable against radial perturbations. Note that as (/)o(0) increases the stars become smaller (more 
compact). 
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Figure 3.2: This figure shows the ADM mass, A/adm, as a function of the central scalar field 
value, 00 (0) (left panel), as well as a function of two estimates, Rgg, and i?95, of the stellar 
radius (right panel). Rgg and Rg5 are defined to be the radii for which M{R) = 0.99AfADM 
and M{R) = 0.95A'/adm, respectively. The Madm(0o(O)) curve has an absolute maximum at 
<j>o{0) = 00 ~ ^-^^ with A/niax = M {(Pq) « 0.633. As discussed in the text, the value (j>Q (shown as 
a dashed line in the plot) signals a change in dynamical stability of the solutions: for 0o(O) < ^o 
the stars are stable to radial perturbations, while those with 0o(O) > 4>q are unstable. The solid 
triangles label specific models computed with 0o(O) = 0.01, 0.02, 0.03, 0.04, 0.05, 0.06, and 0.07— 
these are all on the stable branch. The inset in the left plot highlights the form of Madm(0o(O)) 
for 0.01 < 00(0) < 0.10. Note that for the curves in the right plot, the triangles corresponding to 
0o(O) = 0.01,0.02, . . .0.07 appear right-to-left along the plots — i.e. as noted in the caption of the 
previous figure, as well as in the text, the stellar radius decreases with increasing 0o(O). 
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3.6 Boson Stars: Representation in Cartesian Coordinates 



In Sec. 13.41 we summarized the procedure we use to compute boson star data in areal coordinates, 
as well as the subsequent transformation of the data to isotropic coordinates. As we noted there, 
the coordinate transformation is needed to provide initial conditions in a form that is compatible 
with our 3D code, which assumes conformal flatness of the 3-metric. In addition, since the code is 
3D, and based in Cartesian coordinates, (a;, y, z), we must also perform some interpolation of the 
field values. 

Let us first consider the case where we wish to evolve a single star with some given central field 
value, 4>o{0). We start by solving the static polar-areal ODE system, which yields the fimctions 
a{Rj), a{Rj) and (j)o{Rj), as well as the eigenvalue for the solution, uj ~ aj(0o(O)). Here Rj, 
J = 1,2,..., Nj are the Nj discrete values of the areal radius at which we choose to compute 
(store) the numerical solution of the ODEs. We then determine the coordinate transformation to 
isotropic coordinates, rj = r{Rj), by solving p.78|) with the boundary conditions p.79p . Once 
this is done, values for the conformal factor, ip{rj), are determined from p.81|) and 13.82]) . Next, 
since the time coordinates are identical in the polar-areal and maximal-isotropic systems, the lapse 
function transforms as a scalar, and we have a{rj) — a{r{Rj)). Finally, the matter field, (j)Q, is a 
scalar, so we also have (j^oifj) = cl)o{r{Rj)). 

Having computed tpir), a{r) and Q;(r), it is a simple matter to interpolate these functions onto 
the discrete domain which our 3D code uses to solve the PDEs derived in Chap. [2j Without going 
into the full details (these are given in the next chapter), the finite difference grid points have 
coordinates {xi^yj, Zk) where i = 1, . . . , n^,, j = 1, . . . , Uy, and fc = 1, . . . , n^, so that the grid has 
dimensions y- Uy x n^. We choose a point having coordinates (ccq, yoi -zq) that lies within the 
computational domain and at which we will "centre" the star. Then for all i, j and k we define 



rijk = \J - xqY + {yj - yo)^ + (-Zfc - zqY (3.84) 

and then set 

a{xi,yj,Zk) = a(ryfc), (3.85) 

ip{xi,y.j,Zk) = ipinjk) , (3.86) 

(t)o{xi,yj,Zk) = (l>oiri]k) ■ (3.87) 

Here, the values a(ryfc), ip{rijk) and (f>o{rijk) are computed using quadratic Lagrange polynomial 
interpolation |160| in the values a{rj), '4'{rj) and 4>o(rj), respectively. 

Next, using the values 4)o{xi, yj, 2^), and the scalar ansatz p.4ip evaluated at t = 0, we initialize 
the real and imaginary components of the scalar field by 

0i(xi,?/j,Zfc) = (l)o{xi,yj,Zk) , (3.88) 
(j)2{.Xi,yj,Zk) = 0. (3.89) 

In addition, the field conjugate momenta can be computed from (|2.173|) . again, by using the 
static ansatz (|3.4ip for the complex scalar field, and the fact that the shift vector components 
vanish in the static case. In terms of values defined above, and calculated using interpolation of 
the ODE solution, we have 

Tli{xi,yj,Zk) = 4io{xi,yj,Zk)sm{ujt) , (3.90) 

a[Xi,yj,Zk) 

Tl2{xi,yj,Zk) = (j>o{xi,yj,Zk)cos{ujt) . (3.91) 

(^[Xi , yj , Zk ) 
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Since t = 0, the above expressions further reduce to: 

ni(x„j/j,Zfe) = 0, (3.92) 

Il2{xi,yj,Zk) = -UJ — " ^(l)o{xi,yj,Zk) ■ (3.93) 

a(a;i,2/j,2fe) 

For the case of a binary system, the above process is performed for both stars, and we then 
superimpose the solutions as foUows: 



a{xi,y-i,Zk) 


— Q;^ ^ (xi , yj , 


2fc) ^ 


- Q;'^^'(a;i,yj,Zfc) - 


~1, 


(3.94) 


ip{x^,y,Zk) 




^fe) +ip^^\xi,yj,Zk) 


-1, 


(3.95) 


(f)i{x^,y.j,Zk) 


= (f'[^\xi,y3, 


Zk) -\ 


- <l)?\x,,yj,Zk) , 




(3.96) 


(f>2{xi,yj,Zk) 


^ 4^'(a;,;,yj-, 


Zk) 4 


- (j)''2\x,,yj,Zk) , 




(3.97) 


Ili{x^,yj,Zk) 


= rij [xi,yj. 


,2:fc) - 


Knf'(a;j,yj,Zfc) 




(3.98) 


Il2{xi,yj,Zk) 


— ^ (Xi , J/j 


, Zk) - 


^n^^'(a;j,yj,Zfc) 




(3.99) 



Here the superscripts (1) and (2) refer to the interpolated solutions for the first and second star, 
respectively. In practice, when we set up data for a binary we try to ensure that the two stars do 
not overlap significantly. In this case, we can expect that superposition of the individual lapse and 
conformal functions, as defined above, provides values of a{xi, yj, Zk) and '4){xi, y, Zk) which satisfy 
the slicing and Hamiltonian constraint equations to some degree of precision. However, apart from 
considerations of the extent that the initial setup does represent two distinct and separated stars, 
the issues of whether or not there is substantial overlap of (/)q^^ and (/>q^\ and whether or not 
superposition of the metric functions approximately holds, are not very important. In particular, 
the scalar field variables are, in principle, freely specifiable, so an overlapping configuration provides 
initial data that is no less mathematically valid than one in which there is negligible overlap. 
Furthermore, once we have fixed the scalar field values at t ~ 0, we always solve the elliptic 
equations PDEs for the metric functions to determine their initial values. Ultimately then, the 
values obtained using the superposition formulae (|3.94p and (|3.95p are only used to provide initial 
estimates for the iterative multigrid method that we use for the elliptic system (see Chap. 14]). 



3.7 Boson Stars: Applying Approximate Lorentz Boosts 

The procedure described in the previous section allows us to initialize our 3D dynamical code with 
data representing one or two stars that are initially at rest in the {x,y,z) coordinate system. In 
order to simulate scenarios such as a binary system in which the stars are in mutual orbit about 
one another, or, in general, where there is to be any motion of one or both stars at t = 0, the scalar 
field initial conditions must be adjusted to provide the stars with initial velocities. 

In this section we describe how this is accomplished through the use of Lorentz transformations, 
applied to both the scalar field and metric variables. We emphasize at this point that the algorithm 
that we describe below does not, in general, provide initial conditions (even for a single star) that 
correspond to a pure boost. This is a result of an incompatibility between our straightforward 
approach to applying the Lorentz transformation and the requirement that our spatial metric be 
conformally flat. However, at least at this juncture in our research, we do not consider this to be a 
major shortcoming. What is most important is that we be able to produce initial conditions such 
that the stars propagate through the solution domain, retaining their overall structure, at least 
approximately, as they move. As will be seen in Chap. [5l our current method certainly succeeds in 
this respect. 
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In the calculations that we report in this thesis (again, in Chap. [5]), we have restricted attention 
to configurations in which one or two boson stars are boosted in the x direction. We therefore 
adopt an abbreviated notation which suppresses the functional dependence of the unknowns on y 
and z. Thus, for example, (t>{t,x) is shorthand for (j){t, x,y, z). We let {t,x) be the coordinates in 
the "lab" frame (i.e. the coordinate system in which we ultimately solve the PDEs), and denote 
the coordinates of the rest frame of any boosted star by {t',x'). 

We now focus attention on the case of a single boson star. Then if the star is boosted with 
speed w > in the positive x direction, and keeping in mind that we have adopted units in which 
the speed of light, c, is unity, the Lorentz transformation is 

X = j{x' + vt'), (3.100) 
t = ^{t' + vx'), (3.101) 

where 7 is the usual Lorentz factor defined by 7 = (1 — u^)~^/^. The inverse Lorentz transformation 
is: 

x' = j{x~vt), (3.102) 
t' ^ j{t-vx). (3.103) 

Now, from the static ansatz p.4ip we have the following equations for the real and imaginary 
components of the scalar field, 0i and 02, respectively: 

4>[it\x') = Mx') cos {cot'), (3.104) 
4>2it',x') = -Mx')sm{ujt'). (3.105) 

The time derivatives of these quantities, which we denote by 'n[{t',x') and li2{t',x'), respectively, 
arc then given by 

U\it',x') s Mgl^^_^^„(.^')sin(c.i'), (3.106) 

n^(t',a;') = ^^2!^-!^ ^-ujMx') cos {Lut'). (3.107) 

As a scalar field is invariant under any coordinate transformation, the values of 0i and (j)2 are 
determined by 

01 (i, a;) EE (t>[{t',x') ^ (l)o{j{x~vt))cos[ujj{t-vx)], (3.108) 
02(t,a;) = (f>'2it',x') = -(f>o{-f{x ~vt))sm[u!-f(t-vx)], (3.109) 

where we have used the inverse Lorentz transformations given by (|3.102p and (|3.103p . At the initial 
time, t = 0, we thus have 

(j)i{0,x) = (j)o{jx) cos{uj'-fvx), (3.110) 
02(0,0;) = (j)Q{'yx) sin (uj^vx). (3.111) 

We remind the reader that we are suppressing the y and z dependence of the unknowns, so that 
00(72;) = 00(72;, y, z). This value can be computed from the ODE solution, 0o(r(i?j)), via in- 
terpolation to r^x = — 2;o)^ + (y ~ 2/0)^ + (-2^ ~ -2:0)^, where (a;o,yoi^o) are the lab frame 
coordinates of the point at which the star is centred. 

The time derivatives, ZI'i and 112, are also scalar fields and thus invariant under coordinate 
transformations as well. However, in order to write down expressions for them in the lab frame, 
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we need to go through the Lorentz transformation in more detaiL We first note that the chain rule 
can be used to express the time derivatives in the star frame's in terms of derivatives in the lab 
frame: 



dx d(j)i 



dt 



n,(M) ^ ni(^'>-') = #(^'^-') = ii(M) = ^(M)g,4-:^(M)^, (3.112) 



dx d(j)2 



dt 



n.(M) ^ ni(i',.0= 3^(*'..') = ^(M) = ^(M)g; + ^(M)|^. (3.113) 

The spatial derivatives appearing in the above expressions above can be calculated using equa- 
tions ([3lM| and (f3J09ll : 



-^{jix — vt)) cos [(-L>"f(t — vx)] 
ox 

+ujjv(l)o{j{x — vt)) sin [ujj{t — vx)] 



dx 



{t,x) 



dx 



(7(x — vt)) sin [uj^(t — vx)] 



+uj^v(f>o{'j{x — vt)) cos [i^"f{t — vx)] 



Additionally, we have 



dt 



(3.114) 



(3.115) 



(3.116) 



c' —^{x—vt) 



so that 



d^ 
dt 



kt,x) 



dcjjo 
dt 



(7(2: — vt)) COS [(-L>"f{t — vx)] — uj"f(f)Q{'y{x — vt)) sin [w7(i — vx)] 



090, /^ 
-7-^(^) 



cos [ijJ^{t — vx)] 

x' —■y{x—vt) 

-iu^4>o{l{x ~ vt)) sin [w7(t — vx)] , 



(3.117) 



and 



902 

dt 



it,x) 



d(k 
' dt 



(7(x — vt)) sin [w7(t — vx)] — ujj(j>Q{j{x — vt)) cos [u!j{t — vx)] 



7- Q^i- ) 



sin [u!j{t — vx)] 

x'—^{x — vt) 

ujj(f)Q{'j{x — vt)) cos [ujj{t — vx)] . 



(3.118) 



Using (jXTTil) . (|XTT51) . (|XTT7|) and (jXTTS)) in ((XTT^ and (|XTn|) yields 



ni(<,x) 



112 (t, a;) 



—ui(j)Q{'-^{x — ?ji)) sin [w7(t — wx)] 

r 900, ,x 900, , 
^~dx'^^'~~d^^^^^~ 



cos [w7(t — vx)] 



sin [w7(t — wx)] 



(3.119) 



— a;0o(7(a; — vt)) cos [ijO"f[t — vx)], (3.120) 
where the derivatives with respect to x' are to be evaluated at x' — j{x — vt). At i = 0, the solution 
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in the lab frame is then 



ni(o,x) 



n2(0,a:) 



7W 



7W 



dx 



dx 



x'—'^x 



cos(uj^vx) + uj4>{){'~^x) sin(a;7ux), (3.121) 



sin(u;7wa;) — u]4>o(^x) cos{uj^vx) . (3.122) 



In order to completely determine the initial data — if only to provide good initial estimates for 
the elliptic solver dX t = — we also need to consider the effect of the Lorentz boost on the metric 
components in the lab frame. In the rest frame of the star we have 



I ~a'^(t\x') \ 

V''^(t',a;') 

V^'^(t',x') 

\ '^'^{t'^x') ) 



(3.123) 



We now adopt another notation, in which we use a tilde on lab-frame metric components that 
have had Lorentz transformations applied to them. This distingushcs them from the actual lab- 
frame metric variables that appear in the our CFA model, in view of our above observation that 
the simple-minded boost procedure we use is incompatible with conformal flatness. The metric 
components in the lab frame are then given by 



_ , , ^x'^' dx'" , , , 
gxs{t,x)= g^^ gp.ft:^). 

where, from the general 3 + 1 form we also have 

-&^it,x)+j'ip,p, h{t,x) 



g\5{t,x) = 
The Lorentz boost is given by 



fii{t,x) 



lik{t,x) 



(3.124) 



(3.125) 



dx^ 



( ^ \ 

—71; 7 

10 

\ 1 y 



(3.126) 



(3.127) 



(3.128) 



Thus, for example, the component of the lab frame metric is given by 

go. = /3.(t, x) = A^o A'', g'^, ^ (A% g[,, + A^o g',,) k\ , 

which yields 

h{t,x)=^\[a'\t\x')-^'\t',x')\ . 

Similarly, it is possible to show that the spatial metric in the lab frame is related to the metric in 
the star's rest frame by: 

/ 72 [^'4(t',x')-i'2a'2(t',x')] \ 

1^3 = i^'\t',x') , (3.129) 

V i}'\t',x') J 

As promised, this lab frame 3-metric is not conformally flat. However, as the tilde notation em- 
phasises, it is also not the spatial metric with which the 3D code is eventually initialized, and we 
emphasize that there is no inconsistency being introduced by the current method for determining 
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initial data. In this regard, the metric (j3.129p is perhaps best viewed as an intermediary which aids 
in the computation of other quantities (such as the scalar field momenta, Hi and 112) to produce 
data approximately describing a boosted star. A similar comment applies to the other lab frame 
geometric quantities — namely the lapse, a, and the non- vanishing shift component — that are 
defined in this stage of the algorithm. They too should be viewed as "provisional" values that 
are used primarily to set the scalar field quantities at f = so that something close to a boosted 
boson star results. On the other hand, the true initial values for the geometric quantities are al- 
ways determined — once the scalar field variables are given — by solving the governing set of elliptic 
PDEs (|2.181|) ~ ()2.185p . in which the scalar field quantities play the role of sources. 
Keeping this in mind then, we continue by using (|3.129p and p.l28p to find 



v[a'^{t',x')-^'\t',x')] 
[ip"^{t\x') -v'^a'^(t',x')] 



(3.130) 



However, since the metric components in the star frame are all time independent, the above equation 
becomes 

v[a'^{x')-^p"^{x')] 
" [i,'\x')-v^a'^x')] ' ^^-^^^^ 
and restricting to the initial time t = 0, we have 

V [a'^(7a;) — ij)"^{'^x)\ 



r(o,x) 



(3.132) 



Using a derivation analogous to that just used to determine /3^(0,a;), we can show that the 
initial values of the lapse in the lab frame are given by 



(5(0, x) 

while the conformal factor is simply 



a' {'^x)ip''^(jx) 



■(/'(O, x) = ip'^-jx) . 



(3.133) 



(3.134) 



Finally, using ([XTTOll . ([XTTTI) . (f3J2T|) . (f3J22| . (f3J32| . ([3333]) and ([3l^ . the initial values 
for the scalar field conjugate momenta are given by 



Hi (0,2;) 

n2(o,x) 



(52(0, x) 
(52(0, x) 



ni(0,a;)-r(0,x)^(0,.T) 
n2(0,a;)-r(0,x)^(0,.T) 



(3.135) 
(3.136) 



To summarize, equations p.llOp . p.lll|) . (|3.135p and p.l36p provide initial values for the 
scalar field components and their conjugate momenta in the lab reference frame. On the other 
hand, equations p.l32p . (|3.133p and (|3.134p are used to provide initial estimates for the values of 
a and iJj that are ultimately computed by solving the elliptic equations at i = 0. However, 
they are also used to construct the t = values of the scalar field conjugate momenta as given by 
equations (|3.135p and p.l36p . 

As was the case in the previous section, for configurations involving two stars, the above calcu- 
lations are performed for each star individually. We then set values for the fields using the following 
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superposition formulae: 





— 01 lL),xj-f 




/'Q 1 Q7^ 
^O.iO 1 ) 


(/)2(0,a;) 




-4^^(0,x), 


(3.138) 


ni(o,x) 


= ll['\0,x)^ 


Fnp'(o,x), 


(3.139) 


n2(o,x) 


= n(^^(o,x)- 


^n^^\o,x), 


(3.140) 


r(o,x) 






(3.141) 


(5(0, x) 


= a(^)(0,a;)4 


-a(2)(o,x) - 1, 


(3.142) 


4>{o,x) 


= V;(i'(0,a;) +^-'^^0,2;) - 1. 


(3.143) 



Here the superscripts (1) and (2) again refer to the solutions computed for the two individual stars. 
For the scalar field variables, (pi, (j)2, Hi and 112, p.l37p - p.l40p are used to set the actual initial 
values. Again, however, p.l4ip - p.l43p are used only to provide initial estimates for the multigrid 
elliptic solver which computes the actual t = values of a and ip. 

We end this section by noting that it is probably possible to refine the algorithm described 
here — by iterating the steps of computing the conjugate momenta using expressions such as p.l35p 
and p.l36p . and of solving the elliptic equations for the metric variables — in order to determine 
initial data that more closely approximates a purely boosted boson star. This however, is another 
matter that will require further investigation: as stated earlier, for the purposes of the computations 
described in Chap. [SJ the procedure detailed here appears adequate. 



Chapter 4 

Numerical Techniques 
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This chapter discusses the key numerical techniques that were used to find approximate solutions of 
the partial differential equations (PDEs) that comprise our model. Furthermore, the methods that 
were employed to assess the basic correctness of the numerical implementation and the accuracy 
of the generated solutions are presented. Although some issues are covered here in more detail 
than others, for completeness we have tried to describe all of the main numerical approaches and 
algorithms that factored into the construction of our code. In this regard, however, we note that 
we have relegated some of the more basic and well-known concepts and techniques to Appendices 
|B]and[D] Portions of this chapter closely follow the presentation in jl63j and the interested reader 
is referred to those notes (as well as references contained therein) for examples, as well as more 
detailed discussions of some of the approaches we have used. In addition, for the uninitiated, the 
venerable text due to Burden and Faires |160| provides an excellent general introduction to the 
field of numerical analysis. 

A crucial step in the construction of numerical solutions of PDEs is the choice of an appropriate 
discretization approach. The one adopted here is finite difference approximation (FDA), and, fun- 
damentally, involves replacement of the differential operators appearing in the PDEs with suitable 
finite difference operators. While the differential operators are applied to functions defined on a 
particular subdomain, fl, of the continuum — which in our case is R x MP — in practice, the discrete 
operators are applied to functions defined on a discrete set of points, fl^, often referred to as a 
grid, or mesh.o Sec. 14. II introduces the main concepts we use in the finite difference discretization 
of our PDEs. Based on the FDA concepts and assumptions discussed in Sec. 14.11 Sec. 14.21 then 
continues with a description of the analysis tools that were used to evaluate the fidelity of out 
numerical solutions, as presented in detail in Chap. [S] 

As discussed previously, our model consists of a coupled system of 4 hyperbolic (first order in 
time) and 5 elliptic (second order in space) PDEs. Since the equations belong to two different 
classes, it is natural to expect that different techniques will be needed to efficiently obtain the full 
numerical solutions. In brief, we treat the hyperbolic equations using a second order (in the mesh 
spacing, h) Crank-Nicholson approximation that is solved iteratively, while the elliptic equations, 
also discretized to Oih?), are solved using a multigrid technique. Here we note that, as illustrated 
by the pseudo-code of the overall flow of our code in Fig. 15.11 (Sec. 15. 1( ). at each stage of the Crank- 
Nicholson iteration that performs the basic time-step-advance of the hyperbolics, we re-solve the 
elliptics. A review of basic relaxation techniques, including the point-wise Newton-Gauss-Seidel 
method used in the Crank-Nicholson iteration is given App. [DJ while the multigrid method is 
discussed in some detail in Sec. 14.31 

4.1 Discretization of Partial Differential Equations: Finite 
Difference Approximation 

Two fundamental assumptions underlying a successful finite difference approximation (FDA) of a 
set of PDEs are: 



^We note, however, that it can be useful to view the discrete operators as acting on functions defined on the 
continuum, particularly when considering error analysis of the type discussed, for example, in l4.2.2l 
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1. Given suitable boundary and/or initial conditions, there is a unique solution of the system 
of PDEs to be discretized. 

2. This unique solution is smooth. 

As argued in Chap. [21 any solution of the coupled hyperbolic-elliptic system of equations constitut- 
ing the initial-boundary value problem to be solved in this thesis is expected to be smooth provided 
the prescribed initial data is smooth. Also, although no general proof exists, there is no reason a 
priori to expect solutions of our system (again given appropriate initial and boundary conditions) 
not to be unique. Therefore the choice of finite difference approximation as the discretization 
approach becomes a natural one for the problem at hand. 

The first step in the discretization of PDEs using finite differencing involves the definition of a 
grid (or mesh) over the solution domain Q, C M" . (Wc note that for the time being we will restrict 
our attention to discretization of the spatial domain, so that £7 is a spatial volume: we will return 
to the issue of temporal discretization below.) Loosely speaking, the grid is defined as a discrete 
set of points from the continuum domain that satisfy some criteria. These criteria can be as diverse 
as the physical problems that are governed by systems of PDEs. In general, though, the choice of 
grid points is governed by the geometry of the domain, (and hence the geometries of the domain 
boundaries), as well as the coordinate system in which the PDEs are expressed. For domains with 
very complex structure, for example, it is common to use so-called unstructured grids that adapt 
themselves to the geometry. Grids used for simulations of car crashes or airplane aerodynamics 
are frequently of this kind. Gonsidering another example, a physical problem with approximate 
spherical symmetry will often be formulated in a coordinate system adapted to the symmetry, and 
it will therefore be natural to use a curvilinear grid which reflects that symmetry. For the case 
of the differential equations defining our model, the coordinate system in which the equations are 
written is the familiar Gartesian (or rectangular) one. Thus it is only logical that the grid adopted 
in the discretization also be Gartesian. In addition, the simplicity of our solution domain, combined 
with the fact that the objects that wc will be studying will 1) tend to have comparable extents in 
each of the three coordinate directions, and 2) tend to move through the computational domain, 
suggests the use of the simplest grid possible: a uniform Gartesian grid. For our 3D spatial domain 

C K."^ this consists of an ordered set of points {xi,yj,Zk) = il'* C such that the separation 
between adjacent points in any of the coordinate directions is some constant, h, known as the grid 
(or mesh) spacing, or discretization scale. Specifically we have 

Xi+i -Xi^h, y.j+i - yj = h, Zk+i - Zk = h . (4.1) 

For any FDA the various grid spacings that may appear in the definition of the mesh are the 
fundamental control parameters for the approximation: in particular one hopes to recover the 
continuum solution in the limit that all of the grid spacings tend to 0. We emphasize that in 
this thesis we will exclusively use discretizations (including that of the time variable) that are 
characterized by the single discretization scale, h. 

The specific grid points comprising our uniform Gartesian mesh are readily defined from the 
coordinate ranges of the continuum domain. Thus, for 

(4.2) 
(4.3) 
(4.4) 
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the grid points are 



such that 





— -^min ^" 


ij ft, 


i = 1 . . .Tlx, 




Vj 


= 2/min + {j — 


ij ft, 


j = I . . . Uy, 


(A P.^ 


Zk 






fc = 1 . . . n^. 


(4.7) 




X\ — ^min 


and 




(4.8) 




2/1 = ymin 


and 


Vriy ^maxj 


(4.9) 




Zl — ^min 


and 




(4.10) 



There are therefore Ng ~ x Uy x points in the mesh, and we further note that there is an 
imphcit assumption that each of the ranges Xmax — a^min, 2/max — 2/min and z,nax — -Zmin is evenly 
divisible by h. 

Once the grid fl^ has been defined, we can introduce grid functions w'' : fi'' M on it. For the 
3D Cartesian grid defined above, the grid function is simply equivalent to the Ng values Uijk 
defined on the grid points (x^, yj, Zk) — i.e. Uijk denotes a discrete approximation to the continuum 
value u{xi,yj,Zk). We also observe that the way that we have defined our discrete unknowns 
is often called a vertex centred approach, since the unknowns are defined at the grid points per 
se, instead of, for example, at the centre of a cubic cell composed of 8 neighbouring grid points. 
Discretizations based on this latter approach, and which have frequently been used in numerical 
relativity — especially in studies involving sets of conservation laws such as hydrodynamics — are 
known as cell centred. Finally, Fig. 14.11 illustrates a sample 3D rectangular grid of the type used 
throughout this thesis. 

As already mentioned in the introductory section of this chapter, the derivative operators ap- 
pearing in the system of PDEs governing our model are approximated by finite difference operators. 
It is conventional to characterize any such operator by its difference stencil or difference star. This 
involves specification of the set of neighbouring grid points that appear in the definition of the 
difference operator. For example, if a real function of one variable, u{x), is restricted to its values, 
Ui, defined on a uniform ID grid, il^ = {xi \ Xi+i — Xi = h}, then the second derivative of u at 
X —— Xi can be approximated with the following combination of grid function values: 

^.(.)^ "'-^'';; + "'^\ (4.11) 

The stencil of this finite difference operator (see Fig. 14. 2p has an overall multiplicative factor, 
l/Ii^, and is weighted by the ordered set of integers (1, —2, 1) corresponding to the coefficients that 
multiply the grid function values in the numerator of the approximation. In general, the particular 
grid function values, as well as the corresponding weights, that go into the definition of any finite 
difference operator depend on both the order of the differential operator being approximated and 
the level of accuracy (as a function of the mesh spacing) desired in the approximation. App. [B] 
tabulates all the FDAs used in the discretization of our model equations. In addition, App. [C] 
describes a set of Maple procedures we have written with an aim to automate 1) the derivation 
of finite difference operators of any order and 2) the subsequent discretization of PDEs, of any 
differential order, using these operators. The appendix includes several examples that illustrate 
the usage of these procedures. 

We now turn our attention to some additional basic concepts which are very useful in the 
analysis of FDAs. 
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Figure 4.1: Illustration of saniple 3D finite difference grid of the type used throughout this thesis. 
The figure shows a 3D Cartesian grid drawn in perspective. This particular grid is composed of 



5x5x5 grid points covering the domain Xmin < x < x 



max ; 2/min 



< y <yi 



max; ^min 



< Z < Z„ 



The solid circles represent interior grid points, while the hollow squares denote boundary grid 
points. For clarity, the grid spacing in the z direction is drawn out of scale, and we emphasize that 
throughout this manuscript we take = hy — = h. 
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Figure 4.2: Illustration of stencil for a finite difference approximation of the second spatial 
derivative operator. 
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4.1.1 Basic Concepts Related to FDAs 

Consider a system of partial differential equations described abstractly by 

Lu-f = 0, (4.12) 

where, in the most general case, L is a set of m differential operators acting on a vector, u, of m 
unknown dependent functions: u = (ui, . . . , u^)- Each of the Ui is a function of the n independent 
variables (including time, as appropriate); that is, Ui = Ui{xi, . . . , Xn) for i = 1 . . . m. Similarly, / 
is a vector of m prescribed functions — often known as sources — with fi = fi{xi, . . . , Xn) in general. 
For the remainder of this section, and strictly for the sake of presentation, we will specialize to 
the case m = 1 (i.e. for a single unknown function) , while emphasizing that all of the discussion is 
equally valid for the general-m case. 

We can write any finite difference approximation of (j4.12p as 

where the operator is an FDA of the differential operator, L, is the discrete solution and 
are the values of the source function / restricted to the grid points. The superscript h notation 
emphasizes that h is the fundamental control parameter of the discretization, and we observe 
that ()4.13p will in general constitute a set of algebraic equations — possibly nonlinear — for the 
discrete unknowns, . 

In order to quantify how much the discrete solution, , deviates from the continuum solution, 
u, for any specific value of ft,, it is natural to define the solution error, e'', as 

EE u'' - u. (4.14) 

Clearly, in most cases of interest, the solution error will depend on the (finite) value of h. Thus, 
a key element in the analysis of finite difference approximation schemes is the investigation of the 
relation between and h &sh ^ Q. Does the discrete solution approach the continuum one? If so, 
at what rate? Roughly speaking, numerical analysts judge the quality of the FDA by how fast the 
error goes to zero as a function of h. This motivates the introduction of the concept of convergence 
of the approximate solution. The approximation is said to converge if and only if 

lim = u, (4.15) 

or, equivalently, 

lime'^^O. (4.16) 

The order of convergence (not to be confused with the differential order of the system being 
approximated) measures the rate at which the error converges to zero. If 

lim e'' = 0{hP) , (4.17) 

then the discrete solution is said to converge to the continuum one with order p, or to be p-th order 
accurate. For example, if p = 1, the solution is said to converge linearly (first-order accurate), 
while if p = 2, the solution converges quadratically (second-order accurate). 

In principle, the discrete solution, u^, of the algebraic system of equations (|4.13p can be calcu- 
lated exactly (assuming that a solution exists). In the context of numerical computation, "exact" 
typically means "as accurate as the particular representation of floating-point numbers being used 
allows" ; this is often referred to as "at the level of machine precision" . Let us momentarily restrict 
our attention to the case that the system l|4.13p is linear in the unknowns u'^. Then, again in 
principle (and ignoring issues concerning the conditioning of the linear system) , we could employ 
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standard numerical linear algebra software that implements some variant of Gaussian elimination to 
directly compute u'^ to machine precision. Moreover, for the case that (j4.13p is nonlinear in the u^, 
we could employ a so-called global Newton iteration: at each stage this would require the solution 
of a linear system, which again could be achieved via Gaussian elimination. However, especially 
for the algebraic systems that result from the finite difference discretization of time-independent 
PDEs with dependence on multiple spatial variables (e.g. 2D and 3D systems), the "exact" calcu- 
lation of u'' in this manner is often prohibitively expensive computationally. In particular, for such 
problems, the amount of computational work that must be expended to compute the solution per 
unknown tends to increase as the mesh is refined. The reason for this increase in work per unknown 
as h can be traced to the phenomenon of fill in that occurs during the Gaussian elimination 
procedure, which is typically based on the so called LU decomposition, wherein a matrix, A is 
factored as A = LU. where L and U are, respectively, upper- and lower-triangular matrices. Even 
with special orderings of the unknowns (such as that provided by nested dissection [164j ). the 
matrices that result from finite difference discretization of elliptic operators in 2- and 3-D — though 
sparse — have bandwidths that increase as /i — ?> 0. This leads to an increasing density of non-zero 
elements appearing in L and U, which in turn results in the increasing cost per unknown as the 
mesh is refined. 

Given this situation, it often turns out to be more computationally efficient to calculate an 
approximate solution of the linear system arising from an FDA of an elliptic equation (or a lin- 
earization thereof) through an iterative method. In this case, an "exact" solution is only obtained 
in the limit of an infinite number of iterations (assuming the iteration converges), but in practice 
the iteration can be terminated when some convergence criterion is achieved. We note that iter- 
ative techniques form the basis of the multigrid method that we use to solve the elliptic PDEs in 
our model, and which we discuss in Sec. 14.31 Additionally, they are also used directly to solve our 
time-implicit (Crank-Nicholson) discretization of the hyperbolic equations, as is described in more 
detail in Sec. HX^ 

As the name suggests, an iterative method involves the computation of a sequence of approxi- 
mations of the solution unknown, u^, and we will generically denote any of these approximations 
as m''. It should be noted that implicit in the use of an iterative technique is the fact that ■S'' must 
be initialized in some fashion. 

Given any iterate, u^, an important quantity is the residual, r'' , associated with and defined 

by 

r^' = L''u'' - f\ (4.18) 

Thus, the residual quantifies the amount by which m'' fails to satisfy the FDA (|4.13p . and the 
solution of (|4.13p by an iterative process is then equivalent to driving the residual to 0. 

One last quantity that is very useful in the analysis of FDAs is the truncation error, defined by 

t'' = L''u^ (4.19) 

where we note that u is the continuum solution of the differential system (|4.12[) . Note that from 
Eq. ()4.12p we have = Lu, where the right hand side of this last expression is understood to be 
evaluated on the discrete mesh. We can thus rewrite (|4.19p as 

{L''-L)u^t^, (4.20) 

and from this form we see that the truncation error directly measures the deviation between the 
actions of the finite difference and continuum operators on the continuum solution. 

A finite difference approximation is said to be consistent (with the underlying PDF) if and only 
if the truncation error goes to zero as h tends to zero: 



lim t'' = 



(4.21) 
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Clearly, consistency is a necessary condition for convergence of the FDA. Finally, the FDA is said 
to be p-th order accurate if: 

lim t'' = 0{hP), (4.22) 

h—i-O 

and where p is strictly positive integer. 

4.1.2 Richardson Expansions 

We note that for any given FDA, i'*, of some differential operator, L, the functional form of the 
truncation error, r'', can be explicitly computed, typically using Taylor series expansion. On the 
other hand, it is not immediately clear what, if anything, we can say about the functional form of 
the actual solution error, e'' = u'' — u. However, this was, in fact, addressed almost a century ago 
in a landmark paper by L. F. Richardson |165j . Richardson posited (stated without proof) that 
provided a finite difference scheme was 0{h^) and centred, so that the truncation error, t'*, had 
the form 

r'' h^T2{t, X, y, z) + h^Ti{t, x, y, z) + h^Tc{t, x,y,z) + ... (4.23) 
then the solution error, e'', would have a similar expansion 

e'^ = h^e2{t, x, y, z) + /i'*e4(i, x, y, z) + /i^e6(i, x,y,z) + ... (4.24) 

Here, a key observation is that the functions £2, 64, eg, ... appearing in the expansion of the solution 
error have no h-dependence. This seemingly innocuous observation has far-reaching consequences 
for the analysis of the error in finite difference calculations, and we will refer to an asymptotic {h — ^ 
0) expansion of the form (|4.24[) as a Richardson expansion. Note that one immediate consequence 
of ()4.24p — if it is indeed true for the particular second-order, centred FDA under consideration — is 
that an 0{h?) truncation error implies an 0{h?) solution error. 

In addition, we can postulate the existence of Richardson expansions in schemes where non- 
centred difference approximations are employed: in this case, the expansion will include terms 
proportional to , where p can be both even and odd. Similarly in situations where the approx- 
imation is 0{hP) accurate for p > 2 the expansion will begin with a term ep{t,x,y, z), and 
will contain terms that have strictly even powers of h, or even and odd powers of the mesh scale, 
depending on the type of finite differences used. 

In all cases, should a Richardson expansion exist, we can argue that consistency leads to con- 
vergence: 

lim t'* = 0{hP) ^ lim e'' = 0{hP) . (4.25) 

In general it is not feasible to prove the existence of a Richardson expansion for a given FDA of a 
set of PDEs: such a proof will be at least as difficult as proving global existence and uniqueness 
of the PDEs themselves. However, in simple cases where both the PDE and the discretization are 
amenable to closed form analysis it may be possible to provide a proof — see page 111 of reference 
[166] for an example. Most importantly for us, in practice one can always establish the existence of 
the expansion (non-rigourously) by construction, i.e. through examination of the /i-dependence of 
the numerical solutions themselves. In particular, if the discrete solution converges at the expected 
rate as /i — > 0, then one can usually be quite certain that a Richardson expansion exists. We will 
return to this point in Sec. 14.2.11 

4.1.3 The Crank-Nicholson Discretization Scheme 

To conclude this section we consider a specific type of FDA that can be applied to time dependent 
PDEs, and which was used in the finite-differencing of the hyperbolic equations appearing in our 
model. We illustrate the technique using a very simple PDE, namely the one dimensional advection 
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equation. However, as discussed at the end of this sub-section, the method can be gcnerahzed to 
virtuahy any system of PDEs that are first-order in time. 

The ID dimensional advection equation is perhaps the simplest example of a hyperbolic PDE. 
Posing it as an initial-boundary-value problem on a finite domain we have: 

du{t,x) du{t,x) ^ . 

' = - a^min < x < a;„,ax , i>0, (4.26) 

u{0,x) = uo{x), (4.27) 
M(t,a;max) = 0. (4.28) 

where the last condition can be interpreted as meaning that no disturbances are entering the domain 
from the right. Note that there is no boundary condition at •jj — - Since the only characteristics 

in the problem are given hy t — —x + const, so that at a; = Xmin, the single characteristic field is 
strictly outgoing. (Since it is largely irrelevant to the current discussion, and to keep the following 
presentation straightforward, we will not stipulate any details concerning the update of the grid 
function value at a; = a;inin-) 

One rule-of-thumb that is often used when constructing PDAs is to try to keep the stencils 
as centred as possible (i.e. the geometric structure of the stencil is to be kept symmetric about 
the grid point at which the approximation is applied) . Use of centred schemes typically results in 
several benefits, including: 

1. For given accuracy (e.g. 0{h^)), a minimization of the number of unknowns appearing in the 
stencil. 

2. Conversely, for a given number of unknowns in the stencil, a maximization of the accuracy 
of the scheme (e.g. 0{h^) vs 0(h)). 

3. Symmetry of the matrices, L^, that operate on the grid function, and that result from 
the discretization (or a linearization of the FDA). 

Concerning the last point, the properties of symmetric matrices are well known, and can often be 
used to prove existence and uniqueness of solutions of the discrete equations. In addition, stability 
theorems that establish convergence of iterative techniques for solving the linear systems resulting 
from finite-difference approximation are generally easier to establish for the case of symmetric 
matrices. Here and throughout this thesis, then, we use centred difference approximations (in time 
as well as in space) whenever possible. 

Returning to the advection equation, we introduce a discrete domain (i",^^) as follows: 

(4.29) 

l,2,...n„ (4.30) 

(4.31) 

(4.32) 

(4.33) 

where A is known as the Courant factor. Here, and throughout the thesis, we will assume that for 
any specific sequence of calculations in which the mesh resolution is changed, A is held fixed. This 
means that the overall difference scheme will always be characterized by the single mesh spacing, 
h. 

The Crank-Nicholson discretization scheme is a two-level (i.e. involves unknowns at two discrete 
instants of time, t" and t""^^), 0{h?) method, which, as shown in Fig. l4.3[ is centred in both space 
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and time about the fictitious grid point, (i""*"^/^, Xi). One obvious advantage of tliis method is 
that it uses only two time levels. Related to this is the fact that among schemes which are 
0(/i^) accurate in time, the actual magnitudes of the temporal contributions to the truncation 
and solution errors tend to be minimized. Finally, another appealing feature of Crank-Nicholson 
discretization is that it tends to minimize the appearance of instabilities that frequently arise in 
the finite difference approximation of hyperbolic equations. As applied to the advection equation, 

Q © o 

o ^ O t" 



Figure 4.3: Crank-Nicholson Stencil. The fictitious grid point, (t^+'^Z^, Zi), about which the 
scheme is centred, is marked with the x symbol. 



and using the standard 0{h^) approximation of the first spatial derivative, the Crank- Nicholson 
scheme is 



At 



2Ax 



2 < i < Ur - 1 . 



(4.34) 



Here is an O(Ai^) {0{h'^)) time-averaging operator. Its action on any grid function is defined 
by 

Mt< = J (4.35) 



Inserting the above definition into (j4.34p yields the following explicit expression for the Crank- 
Nicholson scheme as applied to the advection equation: 



At 



n+l _ n+1 



2Ax 



These last equations can be rewritten as 



A 



,n+l 



n+l 



A 



n+l 



^i+1 "'i-1 

2Ax 



^i+1 



2 < i < Ut — 1 ■ 



2 < i < - 1 



(4.36) 



(4.37) 



The above equations, supplemented with the boundary condition u^nt^ ^ (that follows from (|4.28p ). 
and some auxiliary condition of the form u"^^ = U{u"'^^ , i = 2 . . . Ux', uf, i ~ 1 . . . rtx), consti- 
tute a linear system for the unknowns u"~^^ , i ~ 1,2, . . . ,nx (provided that U is linear in w""*"^, 
i = 2 . . . Hx) ■ That is, we can write the update in the form 



Au 



n+l 



(4.38) 



where A is an Ux x Ux matrix, and u"+^ and b are length-Tia, column vectors. 

We note that A is not diagonal — that is, there is coupling between the individual advanced- 
time unknowns, u""^^, and hence our discretization is an example of what is known as an implicit 
scheme. (In contrast, an explicit scheme would be one for which it would be possible to write. 



for arbitrary Ux, explicit expressions for the u"'*'^.) Now, assuming that the update for m"^^ is 



n+l 



of the form 



n+l. 



1 . . .Ux), then A is actually a tridiagonal matrix. That is. 



only the elements of its diagonal, as well as the the immediate upper and lower diagonals are non- 
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vanishing. We note in passing that these tridiagonal systems can be solved very efficiently — with 
0{n^) operations — using specialized solvers such as those available in the widely used LAPACK 
linear algebra package [167) . 

The evolution equations used in this dissertation were cast as a system of first order PDEs in 
(t,x, y, z), with each individual PDE assuming the following general form: 

^^^^h^^ = F{x,y,z,u^u,,u^,). (4.39) 
at 

Each evolution equation was then discretized using an 0{h^) Crank-Nicholson scheme. As explained 
above in the context of the advection equation, and emphasizing that all mesh spacings, Ax, Ay, 
Ay and At are 0{h?'), this involves the following approximation for the time derivative: 



du{t, X, y, z) _ - u'ijk _ du{t, x, y, z) 



dt At dt 



n+l/2 



Oih^) . (4.40) 



Again, as was the case for the advection equation, the spatial part of the PDEs are replaced by 
approximations that average between the advanced and current time levels using the time averaging 
operator, fit. Specifically, for a general right hand side, F, we have 

F{x, y, z, u, u„ u,,) « i + Fi;,) = Fix, y, z, u, u^, u,.)^^'^ + 0{h^) , (4.41) 

where F denotes an approximation to F in which Ux and Uxx have been replaced by the usual 0{h?), 
centred finite-difference formulae. The result of the Crank-Nicholson discretization, as applied to 
our system of 4 hyperbolic PDEs, Eqs. 12.1731 [2^1 74[ is a non-linear system of algebraic equations 
to be solved for the advanced-time unknowns, ■ Here we remind the reader that u represents 
any of the hyperbolic variables appearing in our model; that is u is any of 01 , 02 Hi or 112 • 

Fortunately there are efficient iterative methods to solve this type of algebraic system of equa- 
tions. In particular, we chose to apply point-wise iterative Newton-Gauss-Seidel relaxation. We 
also note that the relaxation was applied to each of the 4 hyperbolic unknowns individually (the 
decoupled approach), rather than collectively, which would have involved the solution of a 4 x 4 
system at each grid point. 

We should also remark that discretizations of hyperbolic systems of equations are notorious 
for being susceptible to numerical instability [1691 1170] . However, the implicit Crank-Nicholson 
discretization scheme is well known for being an unconditionally stable scheme — i.e. stable for 
arbitrary values of the Courant factor — for large classes of time dependent problems. Although 
we have no mathematical proof of stability for the scheme applied to our hyperbolic equations, 
we have not encountered any issues with numerical stability in the calculations described in this 
thesis. 



^*The reader can refer to App. [D]for a brief overview of the method or to Varga 11681 for a more comprehensive 
treatment. 
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4.2 Convergence Testing of Finite Difference Solutions 

This section introduces two techniques that we employ to estabhsh that our finite difference solu- 
tions do converge to continuum solutions of the PDEs governing our model. The first technique, 
which is now almost universally employed in numerical relativity work (as well as in other fields) , is 
a basic convergence test that, for fixed initial data, examines the behaviour of the finite difference 
solution as the discretization scale, /i, is varied. In our case, this is done by defining a quantity, 
which we call the convergence factor, Q^{t), and which is easily computed from a series of calcu- 
lations in which successive discretization scales are typically related by a factor of 2. When the 
empirically measured Q^{t) behaves in the expected fashion, we establish that the finite difference 
solution is converging to some continuum solution, but there is then still the possibility that it 
may not actually be a solution of the original PDEs. This could easily happen if the overall finite 
difference approximation was not consistent with the model PDEs, but was consistent with some 
other set of PDEs (an omission or accidental modification of some term in any of the original 
PDEs would generally lead to precisely such a situation). We thus use a second technique, which 
is perhaps not so commonly used in our field, which is termed independent residual evaluation, 
wherein we essentially directly verify that our discrete solutions do approach the desired continuum 
solutions as ft, — )• 0. 



4.2.1 Convergence Factor 



As suggested above, we perform convergence tests in a very straightforward manner: we fix initial 
data, and then calculate numerical solutions using at least three different discretization scales 
(which we will often refer to as levels of discretization) . Successive levels are characterized by mesh 
spacings that arc in a 2 : 1 ratio. Thus, at a minimum — and again emphasizing that we fix the 
initial data, as well as all parameters defining the numerical solution, except for h — we calculate 
^ u^^ and u*'', which arc finite difference solutions generated with mesh spacings h, 2h and 4/i, 
respectively. In addition, and again for the sake of simplicity and convenience, we ensure that each 
grid in the sequence "aligns" with the next coarsest grid. That is, the grid points of the \evel-h 
mesh are a subset of those of the level-2/i mesh, which in turn are a subset of those of the level-4/i 
mesh. We then define our convergence factor, Q''(t), as follows 



(4.42) 



where || • || is any appropriate spatial norm, such as I2 (|| • II2) or loo (|| • ||oo),Ej 

1 1/2 



1 



(4.43) 



it'^lloo = max \u'l\ 

l<i<n 



(4.44) 



and where it is to be understood that the subtraction of individual grid function values implicit 



in the expressions u 



2h 



and u*'^ — u'^ occurs only at the set of grid points common to the two 



meshes. Now, assuming that 

1. our FDA is 0{h?) and completely centred, so that the truncation error, r'*, contains no terms 
proportional to where Po is an odd integer. 



■^^We note that, in contrast to what is often done in numerical analysis, we define the I2 norm here to include a 
normalization by an appropriate power of the number, n, of grid points — i.e. by 1/ s/n. When dealing with functions 
defined on different meshes, this is the natural and convenient approach: for example, grid functions defined on 
meshes with different resolutions will then tend to have approximately the same norm. 
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2. any finite difference solution, u'^ has a Richardson expansion of the form (|4.24|) . 
then the discrete solutions m'', u^'' and w""' can be expanded as 

u'' = u + h'^e2 + h^e4, + ... (4.45) 

u+{2h)^e2 + i2h)'^ei + ... (4.46) 



u 



2h 



■^^ = u+{Ahfe2 + {'ihfei + ... (4.47) 



u 



Substitution of the above expressions into (|4.42p yields 

^ II {u + (4fe)^e2 + (4fe)4e4 + 0{h^)) -{u+ {2hfe2 + [ihfe^ + 0(^6)) || 

||(M+(2/i)2e2 + (2/i)4e4 + 0(/i6))-(u + /i2e2 + /i4e4 + 0(/i6))|| ^ 

||l2/i2e2 + 240/i''e4|| 



||3/i2e2 + 15/i4e4|l 
Thus, as the mesh spacing tends to 0, we have 

||l2/i2e2 + 240/1^6411 12/i2||e2|| 
lim Q^\t) = lim \ — ^ = lim = 4 . 4.49 

Note that in deriving (j4.49p we have assumed that the terms in the (asymptotic) Richardson 
expansion are monotonically decreasing. This will usually be the case when the mesh scale, h, is 
significantly smaller than the typical scale over which u varies. Conversely, if h is of the order of 
the variation scale of u, one cannot reasonably expect that Q^{t) will be close to 4. 

Finally, as we have already mentioned, one needs a minimum of 3 levels of discretization to 
compute Q'^(t). In practice, it is best to use as many levels as possible in testing for convergence. 
As finer and finer discretization scales are used, we expect Q^{t) to get closer and closer to the 
constant 4, and deviations from this anticipated behaviour can be used as a powerful diagnostic 
for detecting subtle mistakes in the FDA (e.g. where certain terms that are relatively small have 
only been discretized to 0{h) accuracy, or perhaps not even consistently with the PDEs). 



4.2.2 Independent Residual Evaluation 

As mentioned previously, although a convergence test of the form detailed above can provide strong 
evidence that a discrete solution is converging to something as /i ^ 0, it does not directly establish 
that the limiting solution satisfies the original set of PDEs. We therefore introduce a second 
technique, known as independent residual evaluation Ell, that aims to remedy this shortcoming. 
Importantly, the method can be used to test the correctness of an arbitrary FDA of an arbitrary 
set of PDEs, as well as the actual computer code that solves the FDA. We feel that is difficult to 
overemphasize the value of using this technique for testing the complete process of discretizing a set 
of PDEs, and then solving for the discrete solutions. Particularly for complicated PDEs in multiple 
dimensions, the PDAs used, and the algorithms needed to solve them can be extremely complex: 
there are thus many places where mistakes can be made, and it will not always be obvious from 
the numerical solutions that something has gone awry. Independent residual evaluation has the 
potential to detect virtually any and all errors that have been made, assuming only the following: 

1. The PDEs that we start from are correct. 

^''Available computational resources tend to be the limiting factor in this respect: for 3+1 dimensional compu- 
tations such as that performed in this thesis, a calculation with mesh size h/2 requires about 2'' = 16 as much 
computer time as one with mesh size h. 

^^This technique was first introduced by Choptuik in his studies of critical phenomena in gravitational col- 
lapse |171| . where its use was crucial to the validation of new and unexpected results. 



85 



2. The discrete solutions, u'', that are computed have Richardson expansions. 

We also note that one of the greatest advantages of independent residual evaluation is that it does 
not require the existence of any exact or previously-computed solution of the PDEs. 

We illustrate the technique by again considering a general set of PDEs written in the form 
introduced in Sec. 14.1.11 El 

Lu~ f = 0. (4.50) 

The continuum equation is discretized as 

L^u^^f^ = 0, (4.51) 

with the presumption being that the discretization is correct. Thus, assuming that is a second 
order approximation of L, we will have 

L'' = L + h^E2 + 0(/i^) , (4.52) 

where E2 is a differential operator, whose form can be explicitly computed, and that will be of 
higher degree than L (e.g. if L involves second derivatives, then E2 will generally involve fourth 
derivatives, given that the FDA is 0[h?)). We now assume that we have computed numerically 
an approximate discrete solution, vP-, and have shown that it is converging at second order to 
some continuum function, u (e.g. so that Q^{t) « 4). We can therefore be confident that u'* has a 
Richardson expansion of the form 

u'' = u + h^e2 + 0{h^). (4.53) 

Furthermore, by construction (i.e. by virtue of the actual numerical calculation), we assert that 
the residual, r'' , associated with u'' has a magnitude less than some convergence tolerance, e, where 
e can generally be much smaller than the typical magnitude of the truncation error, r''. That is, 
we have 

ll^h^/i _ ^ ii^hji ^ g ^ ii^^ii (4 

Relative to the discretization (|4.51|) . then, we have established that we have a convergent 
solution, which limits to u. What we have not established, however, is whether u = u. In order to 
do so, we need to simultaneously establish that is a consistent approximation of L, and that 
our implementation correctly solves the algebraic equations (|4.5ip . 

We thus consider an independent (distinct) discretization of the PDE: 

lh^h_jh^Q (4.55) 

This new discrete operator L'' can be expanded in the same manner as L'' was: 

L'* = L + h^E2 + 0{h^) (4.56) 

where E2 is another higher-order differential operator that will be not be the same as the operator 
E2 appearing in (|4.52p . Note that we assume here that L'' is also an 0{h^) approximation to L, 
but this is not essential (e.g. an 0{h) approximation could also be adopted). Given this second 
discretization, the process of independent residual evaluation consists simply of applying the left 
hand side of (|4.55p to our putative numerical solution, u'\ Defining to be the independent 



^*As in Sec. I4.1.T1 and for clarity of exposition, wo suppress the (t,x,y,z) dependence of continuum functions (n, 
/, T2 etc.) as well as their discrete counterparts {u^ , etc.). In addition, we will proceed by assuming that L 
and any finite difference approximations of L, as well as the operators E2 and E2 appearing in H4.52|l and l)4.56|l . 
respectively, are linear; however, the technique is equally applicable to nonlinear equations. 
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residual we have 

I'' = lS-j'' - = \l + h^E2 + Oih'^)] [u + h^e2 + 0{h^)] - (4.57) 

= Lu^ + {e2U + Lea) + 0{h^) (4.58) 

= Lu-f' + 0{h^). (4.59) 

Here we have again used the (inessential) assumption that L and -E2, are both linear. 

Now, if the computed continuum solution, u is not a solution of the original PDE we will 
generically have u{t^x^y,z) = u{t^x^y,z) + eo(t, x, y, z), where ep = 0(1), and as ft, — >■ wc will 
find 

I'' = V'u'' - f ^Lu- f + Leo + 0(/i2) = Leo + 0(/i^) . (4.60) 

Thus, unless Leo = 0, which is extremely improbable, then as /i — > 0, we will find that will con- 
verge to some non-zero function, given by Leo. In other words we will have = 0{1). Conversely, 
if u and u are the same function, we will have 

= L'^u^ - = Lu-f' + 0{h^) = 0{h^), (4.61) 

with the crucial observation being that as ft ^ 0, i.e. that really is "residual" quantity 

with respect to both the independent discretization, and the continuum PDE. If we compute in 
a typical sequence of calculations aimed establishing convergence (i.e. with all problem parameters, 
save ft-, fixed), and observe behaviour as given by (|4.6ip then we have provided very strong evidence 
that the approximate discrete solution -u'' is converging to the true continuum solution u of the 
PDE. 

In order for the technique of independent residual evaluation to be effective, it is vital that the 
second discretization, L^, is consistent with the original PDE operator L. Otherwise, measurement 
of an which is 0(1) could signal an inconsistency in L'', instead of some problem with the 
principal discretization, , or in the solution of the algebraic equations that result from that 
discretization. In this regard, one should note the following. First, there is no need to solve the 
system (|4.55p for -u'': rather, the independent discretization L'' is simply applied to the computed 
solution u^. Intimately related to this observation is the fact that for the case of time dependent 
systems, one does not need to worry about the stability of L''. In addition, as already noted, there 
is no need for to be of the same order of accuracy as L''. For example, if is 0{h^) accurate, 
but L'* is only 0{h), then if is converging to the true continuum solution, u, we will find that 

is 0(ft) (i.e. still a residual quantity). However, if convergence is to some function u ^ u, then 

will stih be 0(1). 

All of this means that one has lot of latitude in how L'' is constructed, and, perhaps more 
importantly, one can easily use symbolic computing software to generate L from a high-level de- 
scription of the PDEs that is relatively easy to check for correctness. This significantly decreases 
the probability that a that is measured to be 0(1) is due to a mistake in the derivation or 
implementation of L. 

For example, in the construction of the independent residual operator for our set of equations, 
we used 0(ft) forward difference approximations for time derivatives, and 0{h?) forward difference 
approximations for the spatial derivatives!^ This is to be contrasted to the Oih?) approximations 
used for all derivatives in the construction of the basic difference scheme itself (Crank-Nicholson 
for the time derivatives, standard O(ft^) centred approximations for the spatial derivatives). In 
addition, the Maple procedure described in Add. [Cl was used to generate the actual code to evaluate 
The results from our application of independent residual evaluation to scries of calculations 
for several distinct initial data sets are described in the next chapter, along with the results from 



29por grid points close to the outer boundary of the computational domain where the forward difference formulae 
could not be applied, we used 0{h^) backwards difference expressions. 
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4.3 Multigrid Techniques 
4.3.1 Introduction 

As stated in the previous section, iterative techniques such as point-wise Newton-Gauss-Seidel 
(NGS), provide efficient methods for soiving the systems of noniincar algebraic equations that result 
from our time-implicit discretization of the hyperbolic PDEs of our model (those that govern the 
scalar field) . Specifically, we can generally expect convergence to the level of intrinsic discretization 
error using a number of sweeps through the mesh which is independent of the mesh scale, h. Thus, 
in the absence of any elliptic PDEs, the operation count for solving the hyperbolics would be linear 
in the number of discrete unknowns. 

However, as is well known, relaxation methods — even when they are accelerated using the 
successive-overrelaxation (SOR) technique — are not very efficient for solving the systems that result 
from finite-difference discretization of elliptic PDEs. In particular, the amount of computational 
work per unknown needed to solve such systems generally increases as h 0. 

For example, consider what one might call the Poisson equation in one spatial dimension: 

uix)^. = fix) , (4.62) 

and, neglecting boundary conditions, discretize this equation on a uniform mesh using the usual 
centred, 0{h'^) FDA of u^x- 

fi=0- (4.63) 

As described in App. [Dl assuming that the NGS iteration visits the grid points in so-called lexico- 
graphic order (i.e. i = 2, 3, • • • rix ^ 1), the i-th component of the running residual vector is given 
by 

= ^"^ -f.. (4.64) 

Each relaxation sweep (iteration) is supposed to bring the approximate solution u^'^'' closer to the 
exact discrete solution Ui. The convergence of this process can also be viewed in terms of driving 
the running residuals to 0. In fact, in the analysis of the convergence of relaxation methods |168j . 
the effect of the iteration on the running residual vector is often described in terms of the action 
of the residual amplification matrix, A. Assuming a linear set of discrete equations this matrix 
maps the residual vector at iteration k, r^*"'', to the corresponding vector at iteration k + 1, rC^'+i); 

j.(fe+i) ^ ArC^). (4.65) 

Clearly, A must be a contraction map in order for the iteration to converge, and, ideally, the 
spectral radius, p, of the amplification matrix would always be bounded away from unity — i.e. 
p(A) < 1 — to ensure rapid convergence. Unfortunately, for the case of NGS applied to the simple, 
but representative, set of equations defined by (|4.62p . this is not the case. Indeed, it can be shown 
that the spectral radius of the amplification matrix, Angs satisfies 

lim p(Angs) = 1 - 0{h^) . (4.66) 

Operationally, this means that one must perform 0{h^^) = 0{n'^) relaxation sweeps in order to 
achieve convergence, so that, for ID problems, the overall operation count is O(ni^), rather than the 



^''Indeed, as noted by Teukolsky 11721 . for many purely hyperbolic systems of PDEs, one can expect the so-called 
"iterative Crank-Nicholson" method to converge to the level of the solution error in precisely 2 iterations. Howrever, 
in our case, due to the additional coupling with the elliptic equations, this observation does not apply. 

Again, the analysis can be extended to the nonlinear case, when a linearization method such as Newton iteration 
is used. 
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optimal 0{nx). Furthermore, the type of behaviour given by (|4.66l) is seen when NGS is apphed 
to virtuahy any finite differenced elliptic system, in any number of spatial dimensions. Although 
the use of overrelaxation (SOR) can significantly improve convergence, one still generally has 

lim p(Asor) = f - Oih) , (4.67) 

and the work per unknown needed to achieve convergence still increases as the mesh is refined. El In 
short, especially for cases such as ours, where an elliptic system must be solved at each discrete time 
step, relaxation techniques do not provide a viable route for solving discretized elliptic equations. 

On the other hand — and again using the simple toy problem (j4.62p for illustrative purposes — 
by expanding the residual vector in terms of the eigenvectors of the amplification matrix, it is 
possible to show that the asymptotic convergence rate (|4.66p is dominated by the eigenvectors with 
wavelengths that are long compared to the mesh scale, h. That is, each relaxation sweep damps 
long-wavelength components of the running residual by factors given by the associated eigenvalues, 
cr'°"^, where crj°"^ = 1 — 0{h?). Conversely, for short-wavelength modes — which we will define 
as modes having wavelengths. A;, in the range 2h < \i < Ah — one finds that the corresponding 
eigenvalues are bounded away from unity, and, importantly, are independent of h. In other words, 
as the amplification matrix is repeatedly applied to the residual vector, the short wavelength (high 
frequency) components of the residual are rapidly annihilated, while the long wavelength (low 
frequency) components are very slowly damped. In short, relaxation methods such as Gauss-Seidel 
tend to be very good smoothers when applied to finite difference discretizations of elliptic PDEs, and 
that this smoothing property applies both to the (running) residual vector, r'^'^^ and the deviation 
between the approximate solution after the fc-th relaxation sweep, u'^, and the exact solution, u'' 
of the discrete equations. 

Fortunately there is an extremely efficient methodology for solving finite difference approxima- 
tions of elliptic equations. This is the multigrid technique |351 1149] . which was largely developed 
by A. Brandt in the late 1970's and early 1980's. In terms of computational complexity, the most 
striking feature of multigrid is that, in many cases, it is able to provide solutions of the discrete 
equations with 0(N) computational work and 0{N) storage, where N is the total number of dis- 
crete unknowns, o Importantly, this computationally-optimal performance can be achieved for 
quite general nonlinear systems of elliptic PDEs, such as the one encountered in our model. It 
should be noted that from the fact that multigrid can provide solutions with 0{N) work one can 
immediately deduce that its convergence rate in such instances must be /i- independent. 

The classic multigrid method (i.e. the approach due to Brandt) builds on two fundamental 
observations. The first of these we have just made: relaxation methods, such as (Newton)-Gauss- 
Seidel, can be used to efficiently smooth both the residuals and solution errors on any particular 
grid. In multigrid then, one uses relaxation not to solve the discrete set of equations, but only to 
smooth the residuals and solution errors. Efl 

The second observation is that once the discrete elliptic problem has been smoothed, it can be 
well represented on an coarser grid, having, for example, a discretization scale 2h. Especially for 
multidimensional equations, the work need to solve the coarse-grid problem will be a fraction of that 
required for the fine-grid solution. Moreover, we can then apply these ideas recursively: relaxation 
on the 2h grid quickly annihilates modes in the residuals and errors that are high-frequency (short- 

Furthermore, one can only attain ||4.67|I when the choice of relaxation parameter is precisely optimal, and 
determining an optimal value is non-trivial in practice, particularly for coupled nonlinear elliptic systems such as 
ours. 

^^The multigrid method is itself iterative, so by solving the discrote equations we again mean "to the level of 
the intrinsic solution error, — «" , where u is the continuum solution and v}^ is the exact solution of the discrete 
system. 

Again, when we speak of "solution errors" in the context of an iterative method such as multigrid, wo generally 
mean the errors in the approximate solution, , at any stage of the iteration, relative to the exact solution, , of 
the difference equations. However, assuming that the true solution error, u'' — « is smooth, smoothing — is 
clearly equivalent to smoothing u'^ — u. 
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wavelength) with respect to 2h, and then we can pose a version of the problem on an even coarser 
grid having mesh spacing, Ah, and so on. Eventually, this smoothing and coarsening process on 
grids with ever increasing mesh spacings leads to a grid with so few points that solution of the 
discrete equations using relaxation (or even a direct technique) requires a negligible amount of 
computational work. The process of working on coarser and coarser grids is then reversed: smooth 
corrections to grid functions on finer grids are interpolated from the coarse grids. This process of 
interpolation introduces new high frequency error components in the fine-grid functions, but these 
can be effectively damped with a few more relaxation sweeps. Once the algorithm returns to the 
original, finest mesh, the relaxation sweeps performed after the interpolation of the correction from 
the second-finest mesh typically results in a solution in which the error has been reduced by a 
substantial amount. 

In the next subsection we will discuss these basic ideas in somewhat more detail for the case of 
a particular multigrid method, known as the Full Approximation Storage (FAS) scheme, which is 
suitable for the treatment of nonlinear problems. 

4.3.2 The FAS Algorithm 

Again adopting a notation in which the dependence of functions on the relevant set of independent 
variables (e.g. {x,y,z)) is implicit, we write a general nonlinear system of elliptic PDEs as 

N[u] = f. (4.68) 

Here N denotes the set of nonlinear differential operators acting on the solution vector, m, while / 
is a vector of source functions which is independent of u or any of its spatial derivatives. We then 
consider some finite difference approximation of ()4.68|) which, as usual, is characterized by a single 
discretization scale h. This yields a nonlinear set of algebraic equations that we write as 

N^[u^]=f'. (4.69) 

The solution of (|4.69|) is expected to be computed through some iterative process. Each step of 
the iteration defines the residual vector, r'', corresponding to the current approximation, -u'', of the 
discrete solution, u'^: 

r'' = N''[u''] - f'' . (4.70) 

The goal of the iterative process is to drive the residual to zero or, equivalently, to drive the solution 
error, w'', to zero, where v'^ is defined by 

u'' = il'' + v'' . (4.71) 

As mentioned in the last section, relaxation methods such as Newton- Gauss- Seidel tend to be 
excellent smoothers. The FAS multigrid method takes advantage of this fact to (implicitly) smooth 
the correction on each member of a hierarchy of grids. Each grid in the hierarchy is labelled 
by a parameter I (the level parameter), where Z = 1 is the coarsest level and I = Imax is the finest. 
From considerations of both total computational cost, as well as ease of implementation, the grids 
in the hierarchy usually satisfy = hi/2, and we have adopted this choice in our current work. 
Starting on the finest mesh, i.e. on level Imax, we perform a few relaxation sweeps using a method 
such as NGS. Iff] Provided that these sweeps do smooth both r'* and v'^ on the fine grid, we can 
then proceed to define a coarse grid problem. This is done by setting up an appropriate equation 
for the correction (solution error), u'', on the coarse grid. 

In order to define this equation, first note that if N'^ were linear, we could apply it to both 

^^We note that, as discussed in Sec. 12.4.51 point-wise Ncwton-Gauss-Scidcl is not always an effective smoother 
for elliptic discretizations, but it is for our system, provided that we work in the standard (i.e. non-compactified) 
Cartesian coordinates. 
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sides of (j4.7ip to get 

N''[u''] = N''[u'' + v^] = N''[u''] + 7V'*[w''] = r'^ + + N''[v^] . (4.72) 

Using (|4.69|1 this could be further simphfied to 

N^[v''] -r'\ (4.73) 

This last equation actually forms the basis of a multigrid algorithm known as the linear correction 
scheme (LCS). However, since N'^ is a nonlinear operator, we cannot proceed along this route. 

Nonetheless, Eq. (|4.73p does suggest a similar treatment for the nonlinear case. Subtracting the 
definition of the residual, Eq. (j4.70p . from the basic difference equation (|4.69p yields 

N''[u''] - N^[u''] = N''[u^- + v'''] - N'''[u^'] = -r'' . (4.74) 

Thus, rather than having an explicit equation for the correction w'', (|4.74p is to be viewed as an 
equation for the "full approximation", = +v^, and hence the name "Full Approximation 
Scheme" . 

From the smoothing assumption, all terms appearing in ()4.74|) are smooth on the scale, h, of 
the finer grid. One can then sensibly pose a coarse grid form of (|4.74p as follows: 

^2hj^2h] _ ^2hjj2h~ft] ^ -/f r'\ (4.75) 

Here, u^^ is the unknown which is to be computed on the coarse grid (note that it does not, in 
general, satisfy N'^^^v?^ = P^), and if^ — known as a restriction operator — transfers a fine grid 
function to the coarse grid. Once u^'' has been determined by solving ()4.75p . the fine grid unknown, 
, is updated using 

u^=u''' + I^^ {u^'' - If^u^) . (4.76) 

Here, — known as a prolongation operator transfers a coarse grid function to the fine grid. 
In practice, /j/i generally performs polynomial interpolation of an order that can depend on: the 
differential order of the elliptic system, the order of accuracy of the FDA, and the specific smoother 
being used. As Brandt [35] emphasizes, Eq. ()4.76|) is to be preferred over the more obvious 

u'^ = I',\u''^ (4.77) 

since the former retains (useful) high frequency information already computed in u'', whereas the 
latter does not. 

Eqs. (|4.75p and (|4.76p constitute the core of the FAS algorithm. Our quick derivation of the 
FAS scheme also corresponds to a specific point of view, wherein the multigrid method is seen as 
a solver that uses a hierarchy of coarser grids to accelerate the convergence of error components 
which have long wavelengths on the fine grid. 

Again as stressed by Brandt j25] ■ there is a useful "dual" interpretation of the FAS algorithm in 
which the fine grids are used to provide correction terms to coarse grid systems, essentially allowing 
unknowns on coarse grids to be determined to the same accuracy — relative to the continuum 
solution — as the unknowns on the finest grid. It is instructive to quickly work through the alternate 
derivation of (|4.75p from this vantage point, and to do so we must introduce the concept of relative 
truncation error. 

Recall from Sec l4.1.l1 Eq. (|4.19p . that the truncation error, r'', of a finite difference scheme is 
defined in terms of the action of the discrete operator on the continuum solution: 

T^' EE N''[u] - p . (4.78) 



For the purposes of the current development, it is convenient to use the FDA (|4.69p in the form 
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= [u^] to rewrite the above equation as 

^ N^[u]- N^[u'']. (4.79) 

Now, if it was possible to know the exact value of the truncation error in advance, then by adding 
it to the right hand side of (|4.69p we would have an equation 

N^[u^] = + T*" , (4.80) 

whose solution, u'', would be identical to the restriction of the continuum solution, u, of the 
PDE (|4.68p to the mesh points. Unfortunately a priori knowledge of is, of course, equivalent to 
a priori knowledge of u, so at first glance, this observation does not seem very useful. 

However, suppose that it is possible to compute some approximation, f'' of . Then, by adding 

to the right hand side of Eq. (|4.69p , we would get an equation 

N^'l] = f'' + , (4.81) 

for a new grid function, u^, which should be more accurate than the solution of (|4.69p . That 
is, we should find — u\\ < — where || • || denotes some norm. 

In analogy to the definition of the truncation error, r'', we now define the relative truncation 
error, r^'', which involves quantities defined on two adjacent levels within the multigrid hierarchy, 
having discretization scales h and 2h respectively: 



2h 
h 



N'^^[ll^u^] - /f {N^'[u^]) . (4.82) 



Again, ifj^ is a fine-to-coarse transfer operator (restriction operator). Once more using Eq. ()4.69p 
in the form = iV'*[u'*], and assuming that if/^f^ = this definition can be rewritten as 

j^2h^^2h^^ j^2h ^^2h^ (4.83) 

Thus, completely parallelling our previous interpretation of t^, the relative truncation error, r^'', 
can be viewed as the correction that must be added to the source term of the coarse grid difference 
equations in order that the coarse grid solution actually coincide with the (restricted) fine grid 
solution. Once more, we are unable to compute r^'* precisely unless we have the exact solutions, 
and u^^, of the two discrete systems in hand. However, during the multigrid solution process, 
we can certainly calculate an approximation, f^'*, of t'^^ using the current estimate, u^, of the fine 
grid unknown: 

~2h ^ N2h [j2h^h^ _ j2h (4 84) 

Then, replacing rf^ with f^'* in (j4.83p . we have derived the "dual" equation for the coarse grid 
unknown, v?^ (i.e. the full approximation): 

j^2h^^2h^^ ^ j2h ~2h ^ (4.85) 

Now from (|4.75p . and again assuming that I^*^ ~ f^^ and that /^'* is linear (which in practice 
it invariably is), we have 

^2ftj^2h] ^ j^2h^j2h^h^_ j2h^h (4^gg) 

= N^''[ll^'u'']- ll'' {N^u^ ~ f^) (4.87) 

= f^' + rl'' (4.88) 

which is precisely Eq. (|4.85|) . Thus the two viewpoints lead to the same coarse grid correction 
equations, and therefore are indeed equivalent. 
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As summarized in Sec. 14.3.11 the FAS coarse grid correction equations arc applied recursively, 
with relaxation sweeps on level I being followed by the initiation of a coarse grid correction on level 
I — 1. When the coarsest grid (l ~ 1) is reached, u^^ can generally be computed very inexpensively 
using relaxation or perhaps some other method (such as a direct solution of the difference equations, 
using a global Newton iteration). Once u'^^ has been calculated, another sequence — consisting of 
fine grid function updates (using prolongation) followed by additional smoothing sweeps — is per- 
formed for I = 2,..., /max This entire process of "working down" the hierarchy from the finest 
to coarsest grid, then "back up" to the finest grid, is known as a V-cyc\e, due to the V-shape 
that results from a standard pictorial representation of the algorithm in which the vertical direc- 
tion encodes the discretization level (with h increasing downwards) , while horizontal displacement 
represents successive stages of the procedure. A pseudo-code version of the basic FAS algorithm 
described above, and which was used in this thesis, is shown in Fig. 14.41 

To conclude this chapter we reiterate that the elliptic PDEs appearing in our model, Eqs. l2.173l - 
I2.185l wcrc discretized using a centred 0{h^) FDA (see App.|B]for the specific difference operators 
used). In addition, we used red-black ordering for the NGS relaxation sweeps, full-weighted re- 
striction for /^'* and trilinear interpolation for . The interested reader is directed to 1149] 
for further discussion of these technicalities. Finally, although we did experiment with collective 
relaxation, due to an original concern that the mixed second derivatives in some of the equations 
might be problematic, we ultimately found that the decoupled approach produced faster running 
code. 
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subroutine MG_update() 
cycle = 1 

do while (residual > tolerance) 

call vcycledmax, cycle, p,q) 

cycle = cycle + 1 
end do 
end subroutine 

subroutine vcycle (Imax , cycle ,p , q) 

do 1 = Imax to 1 ( cycle from fine to coarse levels ) 
if cycle = 1 or 1 not equal to Imax then 
repeat p times: 

perform a NGS relaxation sweep on level(h[l]): 
u[l] = relax_rb_NGS(u[l] ,f [1] ,h[l] ) 

restrict grid functions to level(h[l-l] ) : 
u[l-l] = I [1-1] [1] u[l] 

compute the relative truncation error of the solution 
on level (h [1-1] ) : 

tau[l-l][l] = N[l-1] 1[1-1][1] u[l] - I[l-l][l] N[l] u[l] 

compute the new RHS vector for level(h[l-l] ) , by adding the 
restricted RHS to the truncation error: 
f[l-l] = tau[l-l][l] + I[l-l][l] f[l] 
end if 
end do 

solve the system of PDAs on level (h[l]) exactly: 
do while (residual > coarsest grid tolerance) 

u[l] = relax_rb_NGS(u[l] ,f [1] ,h[l]) 
end do 

do 1 = 2 to Imax ( cycle from coarse to fine levels ) 
compute the CGC from level (h [1-1] ) to level (h[l]) 

and apply the CGC to unknown variables at level (h[l]): 
u[l] = u[l] + 1[1][1-1] ( u[l-l] - 1[1-1][1] u[l] ) 

repeat q times : 

perform a relaxation sweep on level(h[l]): 
u[l] = relax_rb_NGS(u[l] ,f [1] ,h[l] ) 

end do 
end subroutine 



Figure 4.4: A pseudo-code representation of the FAS V^-cycle multigrid algorithm used in this 
thesis. 
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Chapter 5 

Code Validation and Results 



This chapter presents some results from the numerical solution of the system of PDEs derived in 
Chap. [2] and summarized in Sec. 12.51 The results discussed here are restricted to the case of boson 
stars having a potential that only includes a mass term, so that = m^|0p. As discussed 

in Chap. [31 the mass parameter m can be chosen arbitrarily as part of our overall specification of 
a system of units, and we have performed all of our calculations with m = 1. 

Sec. lS.ll provides some details concerning our numerical code as well as the nature of the various 
initial data configurations that we have considered. This is followed by three sections devoted to 
validation and error analysis of the code. Specifically, we report the results of convergence tests and 
independent residual evaluation, as defined and discussed in Ch. IH for representative calculations 
involving: 

1. A generic initial configuration of the scalar field (Sec. 15. 2|) . 

2. Single boson stars at rest in the computational domain (Sec. [573]) . 

3. A single boson star moving through the computational domain (Sec. 15.4")) . 

The overall success of these convergence tests — most notably the convergence of the independent 
residuals — is considered the strongest evidence we have that our code does correctly compute 
solutions of our model that converge to the continuum limit as the basic mesh scale, h, approaches 
0. In addition, we provide evidence that our numerical results conserve ADM mass and Noether 
charge to the expected order of accuracy. 

The final two sections of the chapter focus on the dynamics of two boson stars, and the calcu- 
lations discussed therein constitute the major new computational results of this thesis. Sec. 15.51 is 
concerned with the simulation of a head-on collision between two boson stars, in which we observe 
"solitonic" behaviour that has previously been seen in other studies of sclf-gravitating scalar field 
configurations |1731 1174] 11281 [130] . Sec. 15.61 then considers the interesting case of the simulation of 
boson star binaries for three different choices of initial data parameters, each of which produces a 
distinct end state: 

1. Long-lived orbital motion. 

2. Merger of the stars that results in a conjectured rotating and pulsating boson star. 

3. Merger of the stars that leads to the conjectured formation of a black hole. 

Finally, Sec . 15 . TI summarizes our main results and outlines some possible directions for improvements 
and additional developments of this project. 

5.1 Summary of the Numerical Code 

The set of PDEs governing our model — as derived and discussed in Ch. [2] — -was discrctized using 
the finite difference techniques described in Ch. [3| Specifically, an 0{h?) Crank- Nicholson scheme 
was applied to: 

^^As mentioned in Chap. [2] these configurations are sometimes referred to as mini boson stars, since for any 
plausible particle mass, the total gravitating mass of the star is very small. Consequently, such objects arc thought 
to be highly unlikely to be of any astrophysical significance, especially as dark matter candidates. 



96 



1. The 2 first-order- in-time equations that govern the time evohition of the real and imaginary 
components of the complex scalar field, (j)i{t,x,y, z) and (j)2{t,x,y, z), respectively. 

2. The 2 first-order-in-time equations for the corresponding conjugate momenta, Ili(t,x,y, z) 
and 112 (i, x, y, z). 

Additionally, 0(h?) centred FDAs were used to discrctize the following elliptic PDEs appearing in 
the model: 

1. The slicing condition for the lapse function, a{t, x, y, z). 

2. The Hamiltonian condition for the conformal factor, ip{t,x,y, z). 

3. The momentum constraints for the three components of the shift vector, (t, x, y , z) , (3^ {t,x,y, z) 
and /3^(t, x, y, z). 

We reemphasize that in generating the results described below, Dirichlet boundary conditions were 
imposed on all variables (see Sec. I2.4.5|) . 

The numerical code that was written to solve the finite difference equations (FDEs) originating 
from the discretization sketched above consisted of separate components for the hyperbolic and 
elliptic variables. For the hyperbolic unknowns, the Crank-Nicholson FDEs were generated and 
solved using RNPL (Rapid Numerical Prototyping Language) |175[ 1176] . RNPL takes a high-level 
specification of difference equations written in a natural operator form, and produces routines 
that employ point-wise Newton- Gauss- Seidel iteration to compute advanced-time unknowns. As 
discussed in Chap. |4] this iteration typically converges rapidly for time-implicit discretizations of 
wave equations, and our experience was consistent with that observation. For the case of the FDEs 
governing the discrete elliptic unknowns we wrote FORTRAN 77 routines that implemented an 
FAS multigrid algorithm, as also described in the previous chapter. 

A pseudo-code description of the overall code flow is given in Fig. 15.11 Execution of the program 
begins with a call to the initial data solver denoted IVP_solver in the figure. W\ In turn, this solver 
makes use of a set of routines that generate solutions representing the static spherically symmetric 
boson stars that were discussed in detail in Chap. [3] The interested reader is referred to App. |X] 
for documentation of the highest level routine, bsidpa, that can be used to determine spherically 
symmetric boson star profiles for arbitrary polynomial self-interaction potentials. 

The solver for a single boson star returns a set of spherically symmetric functions, including 
a(i?), a{R) (j){R) and ^{R), where R is areal radius. These functions must then be transformed 
to the isotropic radial coordinate, r, and interpolated to the Cartesian computational domain as 
described in Sees. 13.41 and 13. 6[ respectively. In addition, if the star is to be boosted at the initial 
time, the transformations detailed in Sec. 13.71 are applied. 

In instances where the evolution is to describe the dynamics of a boson star binary, the above 
process is carried out for each star, and the resulting functions from the computations for each indi- 
vidual star are added to produce the data for the binary. Although not necessary mathematically — 
since the initial values of the scalar field variables are essentially unconstrained, other than require- 
ments of smoothness and sufficiently rapid decay as r — >■ oo — when we set up binaries in this 
manner we try to ensure that the stars are sufficiently well separated that the initial configuration 
really does describe two isolated objects. 

Once the scalar field variables have been fixed at t = 0, the multigrid solver is invoked to 
determine the initial values of the elliptic variables, a(0, x, y, z), ^p{0,x,y, z) and /3^(0, x, y, z). 
Here we note that P^{0, x, y, z) is only non-zero when a boost has been applied (either to a single 
star, or to both stars in the binary case). Furthermore, since we only apply boosts in the x direction 

^''We note that our code can handle essentially arbitrary initial configurations for the scalar field. Here we focus 
discussion on the case of most interest, in which the initial data represents one or more boson stars. 

^® Refer to Sec. 13.71 for details concerning the computation of the conjugate momenta, Hi and 112, from the 
spherically symmetric solution for a single boson star. 
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we always have /3^(0, x, y, z) = /3^(0, x, y, z) = 0. In performing the multigrid solution we start 
from initial estimates given by the values of the geometric variables computed from the boson star 
solver — and post-processed in the manner described in Sec. 13.71 

Once the initial data has been determined, the code enters the main time-stepping loop. After 
initialization of the advanced values f (t+dt) and g(t+dt) to those from the previous step f (t) 
and g(t), each time step proceeds by a sub-iteration in which: 

1. The advanced values, g(t+dt) . of the geometry variables are updated using a single multigrid 
FAS y-cycle with the scalar field values f (t+dt) acting as sources. 

2. The advanced values, f (t+dt) , of the scalar field variables are updated using a single point- 
wise Newton-Gauss-Seidel relaxation sweep, with the values g(t), g(t+dt) and f (t) acting 
as sources. 

3. The residuals of the finite difference equations for the advanced values of the scalar field are 
computed. 

This sub-iteration continues until the I2 norm of the scalar field residuals is below some specified tol- 
erance, which was set to 10~^ for the calculations described below. Once the sub-iteration has con- 
verged, the advanced (t+dt) values are relabelled as current values (pseudo-routine swap_levels). 
Invocation of the analysis routine then effects calculation of quantities such as the independent 
residuals, the ADM mass and Noether charges, as well as the periodic output of grid function values 
to disk. This completes one pass of the main loop: the time-stepping procedure is then repeated 
until the specified final integration time, <max, is reached. 

In order to facilitate the presentation of the numerical results that follow, we now introduce 
some convenient nomenclature and notation. Recalling that we use a single, uniform grid (mesh) to 
perform our computations, the actual number of grid points associated with any of our simulations 
can be represented by a vector we call the shape of the mesh, shape is defined by 

sha.j>e ^ [N^, Ny, N,], (5.1) 

where iV^;, Ny and are the number of grid points in the x, y and z, coordinate directions, 
respectively. The total number of grid points in the computational domain is thus simply given by 
NxNyNz- In addition, when performing computations on a series of meshes with grid spacings in 
which each finer scale is 1/2 that of the previous grid, it is useful to introduce the notion of the 
discretization level. Let = miii{Nx, Ny, Nz). Then we demand that there be an integer I such 
that 

AT = 2' + 1 , (5.2) 

where I is precisely what we call the level of the calculation. In many cases we run our code with 
N,j. = Ny = Nz- In instances where this is not true, then tacit in this definition is the assumption 
that we will compute with some minimum level Zmin, with corresponding A^min = 2'"'" -I- 1. Further 
assuming that A^ = N^, the coarsest grid will then be characterized by 

shape;_^._^ [A^niin, A^yo, Nzo] , (5.3) 

with A^j,o > A^, Nzo > N. Finer grids will then have 



shape,^^^_^+, = [2'(A^ - 1) + 1, 2*(A^yo - 1) + 1, 2'{Nzo - 1) + 1] 



(5.4) 
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f(t) = scalar field variables at time t 
g(t) — geometry variables at time t 




t = 

rail TVP =!o1 vpr ff ft crft")! 




do while ("t < "tmax) 




rail ■in-i-t--ia1 cmp<5<5 ff r-t- 'I — > f Ct+dt^ a-Ct^ — > cr 


\ U T^U. U y J 


do whiXe (residuEiX [f ("t+dt)! ^ "toXe]rance) 




caXX MG_update [source = f(t+dt); soXve for 


g(t+dt)] 


caXX CN_update [source = g(t) ,g(t+dt) ,f (t) ; 


soXve for f(t+dt)] 


caXX evaXuate_residuaX [f (t+dt)] 




end do 




caXX swap_XeveXs [f (t) <— > f(t+dt); g(t) <— > 


g(t+dt)] 


t = t + dt 




caXX anaXysis [g(t) ,g(t+dt) ,f (t) ,f (t+dt)] 




end do 





Figure 5.1: A pseudo-code representation of the numerical code. See the text for details concern- 
ing the overall program flow. 



99 

Another definition concerns the coordinates spanned by a particular grid; i.e. the hmits of the 
computational domain. As discussed in 14. II this domain is given by 

^min — *^max ^ 

Vmin — y — 2/max : 

We thus introduce the notion of a bounding box vector, denoted by bbox, and defined by 

bbox — [Xniiri: Xniax: ymin^ Vraax^ ^min: ^max 

We conclude this section with some brief remarks concerning the computing requirements for 
our calculations, as well as the nature of some of the figures we use to illustrate key results. 

First, the simulations presented here were run on single nodes of one of two clusters located 
at UBC, which are known as vnf e4 and vnf e5. Each node of vnf e5 has four 2.4 GHz Dual-Core 
AMD(R) Opteron(TM) processors with a total of 4 GB of memory. Nodes on vnf e4 have two 2.4 
GHz Intel(R) Xeon(TM) CPUs and a total of 2 GB of memory. The most extensive calculations 
were run on vnf e5, lasted approximately 260 hours, and used about 700 MB of memory. Larger 
simulations (up to 1 GB), although possible, were deemed impractical in terms of providing results 
on the timescale of a few days at most. 

Second, many of the figures that follow are surface plots of functions, and for these plots the 
following should be noted: 

1. The "grid lines" in each direction — which are included as a visual aid — are generally only a 
subset of the total that are available: i.e. the number of grid lines does not reflect the true 
resolution of the computation. 

2. Unless otherwise stated, the vertical displacement of the surface corresponds to the value of 
the function being plotted. 



(5.5) 
(5.6) 
(5.7) 



5.8 
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5.2 Generic Initial Data 

A straightforward way to test our code for convergence and consistency is to choose suitably 
generic initial data which can be specified in closed form, and then evaluate the results of its 
time evolution. The initial configuration for the results discussed in this section was based on the 
following generalized gaussian profile for one component of the complex scalar field: 



Here, (j>o, xq, yo, zq, S^, Sy and 6z are adjustable parameters. The freely specifiable variables 
02, III, 112} were then initialized as follows: 

• 0i(O,a;, y,z) = cj){x,y,z), with (2:0,2/0,20) = (4,3,2), 00 = 0.05 and {5.j,,5y,5z) = (4.5,4,4), 

• n2(0,.T,y,z) = -30i(O,a:,y,z) , 

• 02(0, X, y, z) = ni(0, x, y, z) = . 

This produces a off-origin, slightly non-spherical, gaussian lump of field that exhibits non-trivial 
dynamics, but whose evolution also mimics, to some extent, that of a distorted stable boson star. 
We would like to emphasize that the presence of symmetries in the initial data could lead to a 
failure in detecting certain implementation errors. Thus, in particular, we designed this initial 
configuration so that it did not have any of the reflection symmetries x — > — x, y — >■ —y, and 
z — > —z. We should note, however, that the other initial datasets we subsequently describe do 
have at least one of these symmetries, and thus in principle, should lead to time evolutions that 
also possess the symmetry. |f3 We are aware that we could have exploited this fact to reduce the 
cost of many of our computations. Nonetheless, we did not pursue this option since we wanted 
to keep our code general, and also wanted to avoid the complications of implementing symmetry 
conditions in the multigrid solver at this stage. 

The bounding box for the experiments described here was bbox = [—11, 19, —12, 18, —13, 17], 
and three different grids with resolutions in a 1 : 2 : 4 ratio were used for the convergence tests 
and independent residual evaluations. The grid with the finest resolution (level 7), had shape = 
[129,129,129], while the coarsest grid (level 5) had shape = [33,33,33]. The Courant factor 
adopted here — and for all simulations discussed in this chapter — was A = 0.4. 

Fig. 15.21 shows a time-series of z = 2 cuts of the scalar field from a short evolution (roughly one 
dynamical time) of the initial configuration defined above, and where the finest resolution grid was 
used. Here, and for other plots of this type shown subsequently, evolution proceeds left-to-right 
and top-to-bottom. The solution oscillates, but is apparently gravitationally bound, as there is no 
evidence for dispersal of the scalar field. However, we are not so much interested in the evolution 
of this data for its physical content, as for what it can tell us about the convergence properties of 
our code. 

In that regard, we first note that all grid functions remain quite smooth during the calculations. 
Second, as can be seen in the left panels of Figs. 15. 3| 15.41 and [5.5| the convergence factors Q'^{t) 
(defined by I4.42p for the dynamic variables provide clear evidence of second-order convergence. 
That is, Q^{t) for all of the variables remains close to the value 4 that is expected for our second- 
order scheme in the limit that the mesh spacings go to (according to I4.49P . Third, and perhaps 
most importantly, the right panels of these figures show that the independent residuals are also 
converging. Here we note that in order to more easily assess the convergence rates of the indepen- 
dent residuals from the graphs, we have rescaled them so that, should they be converging at the 




(5.9) 



^^This assumes that there are no dynamically unstable modes in the configurations considered that would break 
the symmetries, and that could be excited by truncation error and/or roundoff error effects. 
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expected order, the plots from computations at different discretization levels will be roughly coin- 
cident. Thus, for the geometric variables, where we expect second order convergence, the Z2-iiorms 
of the independent residuals were rescaled in the following way: 

||/'(t)||2^4('-5)||/'(t)||2, (5.10) 

where I refers to the level of discretization: 5, 6 or 7 in this case. The independent residuals for 
the scalar field quantities were rescaled in similar fashion. However, since we expect first-order 
convergence in this instance, the scaling factor is modified accordingly: 

||/'(t)||2^2('-5)||/'(i)||2. (5.11) 

We also observe that, unless otherwise specified, all of the subsequent plots of independent residuals 
in this chapter show rescaled values as well. 

Thus, Figs. 15.31 and 15.41 provide good evidence that the rescaled independent residuals for all 
of the geometric variables are converging as 0{h?), as expected. We also note that the functional 
form of I^{t) for the metric variables is roughly resolution-independent, so that we apparently do 
have 

lim = hit) and not lim = hit, h) . (5.12) 

Similar remarks can be made concerning the independent residuals for the variables <f>i, 4>2, Hi, 
and 112, except that the convergence of the /'' (t) in this case is linear in h. This is to be expected 
since, as also discussed in Chap. 21 the independent discretization adopted for approximate time 
derivatives was a first-order forward difference. 

Fig. 15.61 shows the ADM mass, Madm, and the Noethcr charge. Qjv, as a function of time for 
levels 5, 6 and 7. As the resolution increases, MABuit) and QNit) tend to constant functions, 
indicating that the continuum conservation laws are being recovered as the mesh spacing tends 
to 00 In order to better examine the rate of "convergence to conservation" of these values, 
Fig. 15.71 plots deviations of the ADM mass MADM(i) and the Noether charge QNit) from their 
initial values, A/adm(0) and Q^iO), as a function of time. Additionally, parallelling what we did 
for the independent residuals, the deviations computed at different levels of discretization have 
been rescaled assuming an 0(/i^) convergence rate: 

AA/iDM(i) = 4'-5 (Mi^^it) - A/iDM(0)) , (5.13) 
AQ^(t)=4'-5(Q^(t)-g^(0)), (5.14) 

for Z = 5, 6 and 7. Again, the fact that the plots of the rescaled deviations are nearly coincident 
provides strong evidence of second order convergence (to conservation, as a function of time) of 
both Qjv and Madm- 

Before proceeding to describe our next set of numerical experiments, we make a brief digression 
to emphasize a key assumption that underpins the efficacy of finite differencing. To keep the 
discussion simple, we consider the case of a scalar function of one independent variable, m(cc), and 
assume that we have discretized the continuum domain with a uniform grid with mesh spacing 
h. Then when any differential operator, L, acting on u is approximated with a finite difference 
operator, i'', the error in the approximation will be of the form 

L [uix)] - L'' [uix)] = hPEp [uix)] + ■■■ (5.15) 

where the positive integer p is the order of the approximation, and Ep is a differential operator 
of higher (differential) degree than L. For example, the usual 0(ft.^) centred approximation to 



40 reemphasize, however, that we know of no proof that A/admC*) should be conserved for other than spherically 
symmetric cases. 
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du/dx = u' (x) can be written as 



Ui+i — Ui^i _ u{x + h) — u{x — h) 



2h 2h 



(5.16) 



where ^ e [x ~ h,x + h] and h? /12u"' {^) is the error in the approximation. Note, however, that 
this result is vahd only if u"'{x) exists and is continuous on the [x — h^x + h]. 

This simple example highlights the importance of smoothness of both the continuum and dis- 
crete solutions in the context of finite differencing. Specifically, for the discrete case, "smooth" 
naturally means "smooth on the scale of the mesh" , and when grids that have h comparable to 
the scale of variation of the solution unknowns are used, one cannot expect "good" results from 
convergence tests, including independent residual evaluation. The reader should keep this point 
in mind, especially in later sections of this chapter where the coarsest mesh spacings used are 
comparable to the scales on which the solution is changing. Moreover, as we will see in the next 
section, (discrete) non-smoothness related to issues such as the treatment of boundary conditions 
can also easily spoil the convergence of our finite difference scheme. 
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t = 4.^ t = 6.0 t = 7.5 




t = 9.Q t = 10.& t = 12.B 




Figure 5.2: Time Evolution of a Generic Gaussian Profile. This figure displays a time-series of 
z = 2 cuts of the scalar field modulus \4>it, x, y, z)\ from a short evolution (roughly one dynamical 
time) of the initial configuration defined by Eg. 15.91 Here the value of 0o appearing in (|5.9p is 0.05, 
the range of the plotted data is < \(f>{t,x,y, z)\ < 0.12, and the solution was computed using 
a grid with shape = [129,129,129] (level 7). Note that the solution oscillates, but is apparently 
gravitationally bound, as there is no evidence for dispersal of the scalar field 
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Figure 5.3: Left panels: plots of the convergence factor Q^{t), as a function of time, t, for the 
geometric variables a, ip and /3^, using data from the calculation described in the caption of Fig. [5?^ 
The convergence factor remains close to the value 4 that is expected for our second-order scheme. 
Note that /3^(0) = at all resolutions, and we have thus defined Q^:c(0) = 0/0 = 0. Right panels: 
I2 norms of the rescaled independent residuals (see Ea. l5.10|) . |l/(i)||2, for the same set of variables. 
As the level of refinement increases, the plots of the rescaled I(t) become increasingly coincident, 
providing strong evidence of convergence of our code to the continuum solution. 
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Figure 5.4: Left panels: Plots of the convergence factor Q^{t) for and (t>i, using data 

from the calculation described in the caption of Fig. 15.21 Again, all convergence factors are close 
to the expected value of 4. Right panels: I2 norms of scaled independent residuals, ||/(t)||2, for 
the same set of variables, and where we have defined Q^y{Q) — 0/0 = and (0) = 0/0 = 0. 
Scaling factors for the two shift vector components and the scalar field arc given by (|5.10p and 
(|5.11|) respectively. We see clear evidence for the convergence of the independent residuals at their 
expected rates {0{h?) for and /3^, 0{h) for <^i). 
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Figure 5.5: Left panels: Plots of the convergence factor Q^ {t) as a function of time for the scalar 
field variables 02, Hi and 112, using data from the calculation described in the caption of Fig. 15.21 
Right panels: I2 norms of scaled independent residuals, |j/(i)||2, for the same set of variables. The 
residuals were rescaled using (|5.1ip and clearly are converging as 0(/i), as expected. 
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Figure 5.6: ADM mass Madm(0 and Nocthcr charge QN{t) from the calculation described in 
the caption of Fig. 15.21 Both plots indicate a trend to conservation of the respective quantity, with 
a convergence rate of 0{h?), as clearly illustrated in Fig. 15.71 below. 




Figure 5.7: Rcscaled deviations in ADM mass and Noether charge, as functions of time, t, from the 
calculation described in the caption of Fig. [521 The deviations were rescaled using (|5.13p and (|5.14p . 
and the near-coincidence of the curves is strong evidence of "convergence to conservation" . 
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5.3 Static Spherically Symmetric Initial Data - One Star 

In the previous section we used a generic type of initial data to establish the correctness and 
convergence of our numerical code. The results of our convergence tests (including those of the 
conserved quantities), and independent residual evaluation provide strong evidence that the dis- 
crete equations of motion are consistent, have been implemented correctly, and that the numerical 
solution obtained converges to the continuum solution. In fact, we consider the generic initial data 
test to be sufficiently comprehensive to establish the overall validity of the code. Nevertheless, in 
this section and the next we continue with code tests, albeit with slightly different goals. 

If there are no problems with the discrete equations of motion, or with convergence of sufficiently 
smooth solutions, then any deficiencies identified in the numerical results can likely be traced to 
characteristics of the initial data, or the resolution(s) used in the calculations. With this in mind, 
there were two key motivations for the tests performed in this section. 

First, it is natural to use the spherically-symmetric static configurations that describe single 
boson stars to simultaneously test 1) the evolution code, and 2) our solutions of the ODEs that 
determine the static solutions. Recall from Sec. 13.41 that these solutions are computed from the 
following ansatz for the complex scalar field: 

(l){t,R)^MR)e-"^\ (5-17) 

where R is the areal radius. It is important to note that even though the stars are characterized 
by a time independent gravitational field, r) is still time dependent due to the e~'"* factor in 
the above expression. We should thus expect that the metric variables computed from such initial 
data should exhibit only approximate time independence, but that the time independence should 
become more exact as the discretization scale, /i, tends to 0. 

Second, as also discussed in Chap. |3l the boson stars computed using the above ansatz form 
a one-parameter family, where 4>o{0) is a convenient choice for the family parameter. Different 
members of this family have different ADM masses, as well as different overall strengths of the 
gravitational field (as quantified, for example, by max^j 2m(i?)/i?). In this context it seems rea- 
sonable to expect that for fixed discretization parameters stronger gravitational fields should lead 
to larger numerical errors. 

We thus consider the numerical evolution of two distinct boson stars using our code. Both 
are on the stable branch (see Fig. 13. 2|) and have central scalar field values 0o = </'o(0) = 0.03 
and 00 = '/'o(O) = 0.06, respectively. Here, and in the following, we adopt a notation such that 
'^(poio) denotes the boson star having a central field modulus 4'o{0). With this nomenclature, the 
stars that we will study are S'0.03 and 6*0.06 ■ 60.03 is less massive, and thus less compact than 
So. 06: as can be inferred from Fig. 13.11 To evolve each of these configurations, we choose a set 
of grid parameters that is slightly different than that adopted in the previous section: namely 
bbox = [—15, 15, —15, 15, —15, 15], with the coarsest and finest of the three grids used given by 
shape = [33, 33, 33] and shape = [129, 129, 129], respectively. 

Figs. 15.8] and 15.91 show some snapshots from the time evolution of 60.03 and 6o.o6j respectively, 
as computed on the finest grid. At least to the naked eye these figures seem to indicate that that 
computed geometries are time independent in both cases. It is thus tempting to conclude that with 
respect to the first purpose of this particular experiment, the code has passed the test. 

However, as Figs. 15.101 15. Ill and l5. 121 show, the measured convergence rates for 60.03 quantities 
are significantly poorer than those from the 60. 06 calculation. This runs counter to the expectation 
that stronger gravitational fields and more compact scalar field configurations should lead to higher 
discretization error for a fixed mesh size. As argued above, we are then led to suspect that the 
convergence problem is related to the nature of the specific data that defines 60.03 • 

The defect in the 60.03 computations, relative to the 60. 06 results, is even more apparent when 
one examines the rescaled independent residuals. These are shown in Figs. I5.13[ 15.141 and 15.151 
which plot ||J(t)||2 for all dynamical variables and at all three resolutions used, and where the 
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rcscaling has been done using (|5.10p and (|5.1ip as appropriate. In each panel, the left and right 
plots correspond to the So.oe and 50.03 calculations respectively. Particularly worrisome is the 
fact that for S'0.03 the independent residuals for the shift vector components and Hi are actually 
increasing as the resolution decreases (this may not be immediately obvious due to the rescaling, 
but is in fact the case). On the other hand, consistent with the measured values of Q'^{t), all 
independent residuals for So.oe — with the exception of In^ and I^i — are converging as expected. Ell 

Finally, although Fig. 15.161 provides some evidence that there is convergence to conservation of 
mass and Noether charge for both sets of calculations, it is clear that the rate of convergence for 
mass conservation in the 5o.o3 is indeterminate at best, and that even for the 6*0.06 computations 
it is not definitively second order, as we can see from Fig. 15.171 

We have already mentioned that these results run counter to our intuition that numerical 
calculations of 5*0.067 with its stronger gravitational fields and steeper scalar field gradients, should 
be less accurate than those of S'0.03, for any given resolution. Moreover, since So. 03 *s less centrally 
condensed, the observed lack of convergence cannot be due to any inherent non-smoothness of the 
solution relative to the scales of the grids being used. What has been omitted from the analysis 
thus far is consideration of the boundary conditions. As discussed in Sec. 12.4.51 Dirichlet boundary 
conditions were used for all of the calculations reported in this thesis: this means that the outer 
boundary values for all dynamical variables remain unchanged relative to their initial values as the 
evolution proceeds. As we argued in that section, Dirichlet conditions can provide a reasonable 
approximation to the true boundary conditions — which are to be imposed at spatial infinity — so 
long as the boundaries of the computational domain are sufficiently far removed from the region 
containing matter. This leads us to suspect that for So. 03 the computational domain defined by 
bbox = [—15, 15, —15, 15, —15, 15] is simply not large enough. 

Indeed, a closer examination of the t = 3.0 frame of Fig. 15. 8| as shown in Fig. 15.181 reveals 
a discontinuous wave of scalar matter that originates at the boundaries of the numerical domain 
and then propagates inwards. The jump in the solution is estimated to be of the order of 1% of 
the central value, (j>Q. Clearly, this relatively small, but spurious, boundary effect has a dramatic 
impact on the measured convergence of the solution. As emphasized at the end of the previous 
section, when computing convergence factors and independent residuals, we always assume that 
the continuum solution is smooth, and that our discrete approximations are similarly smooth (on 
the scale of the mesh). Here we see vividly what can happen to calculated measures of convergence 
when this assumption is violated. 

The observed propagating discontinuity in the So. 03 calculation can be immediately connected 
to the fact that the star has a relatively long tail which extends beyond r = 15. This in turn means 
that there is significant dynamical behaviour in the scalar field variables near the boundaries of the 
computational domain — in particular at the next-to extremal grid points. Since Dirichlet conditions 
are used, the time variation in the function values at these grid points generates the discontinuity 
which then travels inwards, eventually contaminating the entire solution. 

For So. 06 this effect is much less pronounced, as can be seen from Fig. 15.191 Thus, although there 
is without doubt some level of discontinuous behaviour at and near the boundaries in this case, 
the resulting impact on the solution is well below the level of truncation error at the resolutions 
used. In other words, we can consider So.oe as providing another example of an initial dataset that 
is "sufficiently smooth" , given the computational domain, discretization parameters and boundary 
conditions. 

To conclude this section we return to the issue of the anomalous behaviour of the independent 
residuals, /^i, that are associated with the shift vector components (see Figs. 15.131 and 15. 14|) . Here 
we argue that the interpretation of the measured values of /^i requires special attention for the 
case of static initial data. 

^'^We do not yet understand the difference in the behaviour of 7ni relative to the independent residuals for the 
other matter variables, but note that there is also a large "glitch" in Qui at early times (see Fig. l5.12)l that is almost 
certainly related. 
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Figure 5.8: Snapshots of the time evolution of \(j){t,x,y,0)\ for the static boson star. 5*0.03, as 
computed on the finest grid (shape = [129,129,129]). At least to the naked eye, the evolution 
appears time independent. 
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Figure 5.9: Snapshots of the time evolution of \(f>{t,x,y,0)\ for the static boson star, So.oe, as 
computed on the finest grid (shape = [129, 129, 129]). As for the data displayed in Fig. 15.81 the 
evolution appears time independent. 
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Figure 5.10: Convergence factors Q^{t) of a and -0 for evolutions of the two static configurations 
(solid black curves) and S'o.oe (blue dashed curves). The superior convergence of the S'o.oe 
data at the resolutions used (shape = [33,33,33], [65,65,65] and [129,129.129]) is apparent. 




Figure 5.11: Convergence factors Q^{t) of (3^ for evolutions of the two static configurations 5o.o3 
(sofid black curves) and 5o.o6 (blue dashed curves). As in Fig. 15.101 the superior convergence of 
the 5*0.06 is clear. 
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Figure 5.12: Convergence factors Q^{t) of (pA and Ha for evolutions of the two static configu- 
rations 5*0.03 (solid black curves) and 5*0.06 (blue dashed curves). As in the previous two figures, 
better convergence is observed in the S*o.o6 data. There is, however, a noticeable "glitch" in the 
5*0.06 results for Qm {t) in the interval 1 < t < 3. Although we do not completely understand this 
behaviour at the current time, we suspect that spurious reflections from the boundaries may be to 
blame. 



115 



t 



5 10 5 10 




5 10 5 10 



t 



Figure 5.13: Rcscalcd independent residuals, ||/(t)||2, for a, and from evolutions of the two 
static configurations 6*0.03 (left panels) and 6*0.06 (right panels). As previously, the residuals were 
rescaled using (|5.10p so that coincidence of curves computed at different levels of discretization 
corresponds to convergence of the residuals at the expected 0{h?) rate. Apart from the residuals 
associated with (3^ (as well as fi^ and /3^, see Fig. 15.14]) . there is strong evidence that I{t) is 0{h?) 
for the geometric quantities. However, as discussed in the text, spurious boundary effects adversely 
impact the convergence of the independent residuals for the 6*0.03 case. 
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Figure 5.14: Rcscalcd independent residuals, ||/(t)||2, for and (pi from evolutions of 

the two static configurations 50.03 (left panels) and S'o.oe (right panels). Residuals were rescaled 
using (|5.10p for and and (|5.1ip for (pi. Although there is clear evidence for the expected 
0{h) convergence of I(j>i{t) for both calculations, for the case of the shift vector components (and 
as in the previous figure), the independent residuals are clearly not 0{h^) quantities. However, as 
discussed in the text, the continuum static solutions have /3*(t, x, y, z) = 0, and Fig. 15.201 provides 
clear evidence that for both calculations, all components of the shift vector do converge to as 
0{h?). Combined with the results from the test that used generic initial data (Sec. 15. 2p . this 
suggests that the anomalous behaviour of I pi (t) can be traced to the fact that the /3* vanish in the 
continuum limit, and, possibly to boundary effects. However, additional investigation is needed to 
provide a completely satisfactory explanation of the observed results. 
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Figure 5.15: Rcscalcd independent residuals, ||/(i)||2, for 02, Hi and 112 from evolutions of 
the two static configurations S'0.03 (left panels) and S0.06 (right panels). Residuals were rescaled 
using ()5.11|) for 01. Except for Ini{t), the expected 0{h) convergence of the residuals is observed 
for the 5*0.06 data. As previously noted in the caption of Fig. 15.131 and as discussed in detail in 
the text, boundary effects produce a deterioration of the convergence of the S'0.03 residuals. We 
suspect the same effect is at play in the case of lu^ (t) from the S'o.oe calculations. 
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Figure 5.16: Plots of the ADM mass A/adm(0 and Nocthcr charge QN{t) for 5*0.06 (left) and 
So. 03 (right), and for the three resolutions used in the calculations. "Convergence to conservation" 
is observed in both quantities for S'o.o6i but not for S'o.oa- 
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Figure 5.17: Plots of the rescaled deviations of the ADM mass MxDM{t) and Noether charge 
QN{t) for S'o.06 (left) and 50.03 (right), and for the three resolutions used in the calculations. 0{h?) 
"convergence to conservation" is evident for the 5*0.06 data, while for 5o.o3, the convergence rate is 
indeterminate at best. 




Figure 5.18: Magnified view of \<f>{3.0,x,y,0)\ (i.e. t = 3.0) from tlie 6*0.03 evolution. Tlie jump 
in the solution is estimated to be of the order of 1% of (po = \4>{0, 0, 0, 0)|. 




Figure 5.19: Magnified view of 10(3.0, x, y, 0)| from the 50.06 evolution. Here it virtually impos- 
sible to visually detect the discontinuity in the scalar field near the computational boundaries. 
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As discussed in Sec. I3.4[ a static spherically symmetric spacetime which has a time coordinate 
adapted to the time symmetry — as ours is — implies that f3{t, cc*) =0. Moreover, the finite difference 
approximations we use for the elliptic equations that govern , in combination with the multigrid 
algorithm that we use to solve the resulting algebraic equations, guarantees that [/3*]^fc = satisfies 
the discretized eUiptic equations, provided that [J*]^^ = 0. Here the [J*]^,; are the grid values 
(at discrete time <") of the components of the 3-momentum, as defined by (|2.177p . Now, at the 
initial time t = = we do have [J'lijj. = 0, so that [I3^]ljk = as well. However, the subsequent 
evolution on the Cartesian grid (xi,yj,Zk) generates non-sphericities in the scalar field variables, 
which leads to non-zero values for [J^]'2jk for any t" > 0. This in turn results in [/3*]^7i. 7^ for any 
discrete time 7^ 0. This fact is clearly illustrated in the plots on the left hand side of Fig. 15.201 
which show ||/3'(i)||2 from the 5*0.06 computations performed at discretizations levels 5, 6 and 7. 
At any given resolution there is clearly approximately linear growth in the norms of all of the 
shift vector components. However, the plots on the right hand side of the figure — which display 
values of the norms that have been rescaled by factors 4'~^ — show that all of the components are 
converging to as 0{h?). Thus, although we do not yet understand why the independent residuals 
for the shift components fail to scale as expected for static data, it seems clear that the failure is 
not related to a convergence problem of the themselves. 
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Figure 5.20: Left panels: Spatial ^2-nornis of the shift vector components, ||/3^(i)||2, |l/3^(t)|l2 and 
||/3^(t)||2 from the 5o.o6 calculation. Right panels: Values of these I2 norms rescaled using (|5.10p . 
These plots provide strong evidence that despite the issues with the independent residuals Ipi (t) 
displayed in previous figures, and discussed in the text, all of the shift vector components are 
converging to the continuum limit, x, y, z) = 0, as 0{h^). 
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5.4 Single Boosted Boson Star 

In the previous section we tested our code using spherically-symmetric initial datasets representing 
single boson stars. Modulo the problems identified when using stars not sufficiently compact 
with respect to the size of the computational domain, we observed satisfactory convergence of 
the numerical results to static continuum solutions, which in addition to providing a further test 
of the PDE code, also established that our procedure for computing the boson star initial data 
was working. However, there is one significant piece of our overall process for setting up initial 
data for boson stars that has not yet been tested. To evolve configurations describing stars with 
non-zero initial velocities — as is done in the next two sections — then once a spherically symmetric 
solution for a star has been generated and interpolated to the 3D numerical domain (Sec. 13. 6p . 
an (approximate) Lorentz boost is applied (Sec. I3.7p . This section thus reports the results of the 
same battery of tests used in the previous section, but now applied to the special case of a boosted 
spherically symmetric solution. 

We perform the tests on the star So.oe and maintain the same grid parameters employed in 
the previous section: bbox = [—15, 15, —15, 15, —15, 15], using three levels of resolution ranging 
from shape = [33, 33, 33] to shape = [129, 129, 129]. The star is boosted in the a;-direction with a 
boost parameter = —0.1. Fig. 15.21] displavs some snapshots of ]c/)(i, cc, y, 0)] from an evolution 
computed on the finest grid. Careful study of the sequence shows that the star is slowly (relative to 
the time scale of the sequence) moving across the computational domain from right to left (i.e. in 
the —X direction). A rough numerical estimate of the rate of change of the star's coordinate position 
with respect to coordinate time yields Ax/ At « —0.1, consistent with expectations. 

Convergence factors, Q^{t), and independent residual norms, ||/(t)]|2, are shown in the left and 
right panels, respectively, of Figs. 15.221 15.231 and 15.241 Once again, these results provide strong 
evidence that the numerical solution is converging to a continuum solution of the PDEs governing 
our model, and at the expected 0{h?) rate. Additionally, Fig. l5.25l and Fig. 15.261 indicate that both 
the ADM mass and the Noether charge are being conserved as ft, — > 0, and that the deviations from 
conservation are also 0{h^). 

We hope that the results from the extensive and comprehensive set of tests that we have 
described thus far in this chapter will have convinced the reader of the following: 

• That we have consistently discretized the equations of motion for our model to 0{h?) in both 
space and time. 

• That our implementation of the resulting finite difference approximation is correct, and pro- 
duces convergent (and thus, implicitly, stable) results. 

• That our procedures for computing and boosting single boson stars are also correct. 

• That convergence testing will reveal situations where the numerical solution has developed 
significant non-smoothness on the scale of the mesh. 

We now proceed to a discussion of the key numerical calculations in this thesis: those that 
involve two boson stars in interaction with one another. 



125 




Figure 5.21: Time evolution of a single 6*0.06 star which has been boosted in the x-direetion 
with a velocity parameter, Vx = —0.1 (see Sec. 13.71 for a full discussion of the algorithm used 
to determine the boosted data). Plotted are snapshots of \(j){t, x,y,0)\ for t = 0.0, 6.0 and 12.8. 
Careful inspection of these frames reveals that the star is slowly moving in the —x direction. A 
rough numerical estimate of the rate of change of the star's coordinate position with respect to 
coordinate time yields Ax/ At « —0.1, consistent with expectations. 
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Figure 5.22: Q'^{t) (left) and rescaled ||/(i)||2 (right) for a, ip and Z?"^ from the boosted S'o.oe 
experiment. Values of the independent residuals norms ||/(i)||2 have been rescaled using (|5.10p . 
These plots provide strong evidence of 0{h^) convergence for all three of the geometric variables. 
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Figure 5.23: Q''(t) (left) and rescaled \\I{t)\\2 (right) for /3?', and (j>i from the boosted 50.06 
calculations. Values of the independent residual norms ||/(t)||2 have been rescaled using (|5.10p for 
Ipy{t) and Ip'^it) and (|5.1ip for Icf,i{t). These plots provide strong evidence of 0(h'^) convergence 
for aU ofpy, and (/>i. 
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Figure 5.24: Q'^{t) (left), and rescaled |l/(i)ll2 (right) for and 0i from the boosted S'o.oe 

calculations. Values of the independent residual norms ||/(t)||2 have been rescaled using (|5.1ip . 
Here we observe some irregularities in the norms of In^ (t) and /iia (t) that we suspect may again 
be due to our use of Dirichlet conditions on a computational domain whose extent is comparable 
to the size of the star. Nonetheless, these plots still provide strong evidence for 0{h^) convergence 
of the scalar field variables. 
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Figure 5.25: ADM mass A/adm(0 (left) and Nocthcr charge Quit) (right) for the boosted 5'o.o6 
calculation. These plots suggest that both the ADM mass and Nocthcr charge arc conserved as 
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Figure 5.26: Rescaled deviations of ADM mass (left) and Noether charge (right), relative to 
their initial values, for the boosted S'o.oe calculation. The deviations were rescaled using (|5.13p and 
(|5.14p . Although it seems evident that the computed Noether charge is converging to a conserved 
value as 0{h^), the convergence rate for Madm cannot be determined with certainty using the 
available data. 
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5.5 Head-on Collision of Two Boson Stars 

In this section we report the resuhs from an numerical experiment describing the head-on colhsion 
of two boson stars which are initially boosted towards one another. We note that head-on collisions 
of self-gravitating boson stars have been investigated previously by other researchers. The prior 
studies include work by: Choi |173[ 1174] who implemented an axisymmctric Newtonian code, |fl 
Balakrishna [6], using a 3D fully relativistic code, Lai |128| who used a fully general relativistic, 
but axisymmetric code, Palenzuela et al^, using a 3D general- relativistic code, and most recently 
Choptuik and Pretorius |130] who used an axisymmetric general-relativistic code. One particularly 
interesting effect that Choi first demonstrated for the Newtonian case is so-called "solitonic be- 
haviour" , wherein upon collision the stars interpenetrate and effectively pass through one another, 
emerging from the interaction relatively unscathed. Subsequently, this behaviour has also been 
seen in the general relativistic calculations reported in |128| and |130| . Here we show that the 
solitonic effect is also present within the context of our current model, which in some loose sense 
is an intermediary between the purely Newtonian and fully general relativistic calculations. 

The experiment was prepared as follows. First, we adopted a discrete domain defined by 
bbox = [—50, 50, —25, 25, —25, 25]. Note that this yields a computational volume with a.n x : y : z 
aspect ratio of 2 : 1:1, and that the coordinate extent in the y and z directions is roughly double 
that used in the calculations reported in previous sections. This last point is important since the 
star configuration that we used in the collision is relatively extended, so we needed to be careful 
to ensure that our calculations were not significantly affected by the boundary-discontinuity issue 
described in Sec. 15.31 

Next we generated initial data for the star S'0.02 (i-e. a star with central field modulus (po = 0.02) 
and the star S'o.oa- Both solutions were then interpolated onto the 3D numerical domain with centres 
at (25, 0, 0) and (—25, 0, 0), respectively. The stars were then given boosts in the x direction having 
equal magnitudes but opposite senses: v^. = —0.4 and v'^ = 0.4. This of course results in an initial 
configuration in which the stars are already approaching one another with a significant relative 
velocity. 

The computations performed in this section were made using three different grid resolutions. 
Let the grids be denoted by Gi, G2, and G3, where Gi is the coarsest and G3 is the finest mesh, 
the associated shape parameters are given by: 

Gi : shape^ = [129,65,65], (5.18) 
G2 : shape2 = [161,81,81], (5.19) 
G3 : shape, = [193,97,97]. (5.20) 

Fig. 15.271 shows the time evolution of \(j){t, x,y,0)\), < t < 125 as calculated on the grid G3, 
using the initial data described above. Significant direct interaction of the stars begins at i « 40, 
and hy t ~ 55 it is essentially impossible to identify two distinct objects. The period of strong 
interaction — during which "interference patterns" are clearly visible — persists until i « 100. The 
stars then emerge from the collision with their initial shapes roughly preserved, and continue to 
propagate until the end of the calculation. 

During the period of interaction, the scalar field modulus reaches a maximum value \(t>(t, x, y, z)\ 
~ 0.059. In addition, the stars come out of the collision with an estimated average coordinate 
velocity Ax/ At ^ 0.24. The rescaled I2 norms of the independent residual, Icj,i{t), for the scalar 
field component, 0i, are plotted in Fig. 15.281 They provide validation of the computation since 
they are converging at the expected 0{h) rate. 

Given the previous observations of the solitonic nature of head-on boson stars in fully general 
relativistic cases, the results of this experiment can be interpreted as providing qualitative evidence 



*^This required the solution of the Schrodinger equation for the complex scalar field, and the Poisson equation for 
the Newtonian gravitational field. 
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that the CFA is capturing at least some of the essential physics described by a solution of the full 
Einstein-Klein-Gordon equations. However, we must again note that Newtonian calculations also 
yield the same type of behaviour. Thus, without some sort of direct comparison with fully general 
relativistic results, our computations should be viewed as providing a relatively weak validation of 
the suitability of the CFA as a replacement for the Einstein equations in the treatment of head-on 
collisions of boson stars. 

Finally, we note that the initial conditions used in this section have y ^ —y and z — > — z 
symmetries. As we have mentioned previously, these were not exploited in this (or any other) 
calculation. However, in this case we have estimated how well the reflection symmetry y — > —y is 
maintained during the evolution. Results of this test are shown in Fig. 15.291 which suggest that, at 
any of the resolutions used, the maximum deviation from exact symmetry in a typical dynamical 
variable — 10| in this instance — was about 1 part in 10^. 
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Figure 5.27: Head-on Collision of Boson Stars: Solitonic Dynamics. This figure shows a scries 
of snapshots from the time evolution of |(/)(t, a;, y, 0)|, for < t < 125. The initial configuration 
describes two different boson stars, 6*0.02 ^i^d 6*0.03, centred at (25, 0, 0) and (—25, 0, 0), respectively, 
and boosted with velocity parameters v^. = —0.4 and = 0.4. At time t sa 40, the stars are 
interacting substantially, and by t = 55 it is virtually impossible to identify the individual stars. 
During this interaction epoch "interference patterns" that arc characteristic of boson star collisions 
of this type are evident. By t 100 the period of interaction has essentially concluded and 
distinct star-like configurations can again be identified. The stars emerge with their shape roughly 
preserved — thus exhibiting "solitonic" dynamics — and continue to propagate in the directions of 
their respective original boosts. 
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Figure 5.28: This figure shows the rescaled I2 norms of the independent residuals, ||J^j(t)||2, 
from the caleulation of the head-on collision of S'002 ^'^^ ^0 03- The plot focuses on the time 
interval 50 < ^ < 100, when the interaction between the stars is strongest. The blue, red and black 
curves correspond to runs using grids Gi, G2 and G3 (see text), which had mesh scales hi, /12 
and /13, respectively. The red and black values have been rescaled by factors of ft.i//i2 and /11//13 
respectively, and the near-coincidence of the curves shows that the independent residuals are 0{h), 
as expected. 
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Figure 5.29: Head-on Collision of Boson Stars: y — > —y Symmetry Assessment. This fig- 
ure plots a measure of the extent to which our code breaks the y ^ —y symmetry that is ex- 
pected in the continuum limit for the head-on collision of S}^^2 ^^'^ "^oos- Specifically, we define 
A0(<, x) = \(j){t, X, 5, 0)| — X, —5, 0)1 and then plot \\A(p{t, x)\\2, where the norm is taken over the 
remaining spatial coordinate, x. Blue, red and black curves correspond to calculations on Gi (coars- 
est resolution), G2 and G3 (finest resolution), respectively. The maximum value of |0(t,x, 5,0)| 
attained during the computation is about 10~^. We thus conclude that our code preserves the 
y — > — y symmetry to about a part in 10^ for this particular set of calculations. 
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5.6 Orbital Dynamics of Two Boson Stars 

In this section we summarize what we consider to be the most interesting and important calculations 
that we have performed with our code to date. We focus on dynamics that involves two identical 
boson stars — 150.02 — but now set initial data so that, at least initially, some form of orbital motion 
results. The stars continue to be centred in the z = plane, but at i = have centres on the y 
axis which are equidistant from the origin. We then give the stars initial boosts in the a; direction 
which arc of equal magnitude, v^, but of opposite sign. By adjusting the size of v^, we can produce 
evolutions that lead to one of three distinct end states, as already summarized in the introductory 
section of this chapter. Specifically, we have run calculations for = 0, 0.03, 0.05, 0.07, 0.08, 
0.09, 0.1 and 0.11. From this sequence of computations, and again noting that we used stars 
with (j)Q = 0.02, we have identified approximate ranges for Vx associated with the three types of 
behaviour: 

1. Long-lived orbital motion: 0.09 < v.^ < 0.11. 

2. Merger of the stars that results in a conjectured rotating and pulsating boson star: 0.07 < 
Vx < 0.09. 

3. Merger of the stars that leads to the conjectured formation of a black hole: < w^; < 0.07. 

Following a description of the computational setup used for our parameter survey (in Vx), we 
will proceed to discuss representative examples from each class in some detail. 

All the calculations documented in this section were performed on a computational domain 
defined by bbox = [—60, 60, —60, 60,-60, 60] , and all were made using four different grid resolutions. 
Denoting the grids Gi, G2, G3 and G4, where Gi is the coarsest mesh and G4 is the finest, the 
corresponding shape parameters are given by: 

(5.21) 
(5.22) 
(5.23) 
(5.24) 

As usual, there is a z — > —z symmetry in the computations, and the plots that are displayed were 
generated from z = cuts of the full 3D solutions. The two identical stars, 5o.q2 and 5g q2, are 
initially centred at = (0,20,0) and G^ = (0,-20,0), and given boosts in the x direction with 
v^. = —Vx and = Vx- We emphasize that Vx is being treated as a control parameter for our 
experiments: in particular, the only difference in parameter settings for the three cases described 
below is the value of Vx that is used. 
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5.6.1 Case 1: v,^, = 0.09 — Long Lived Orbital Motion 

Results from this calculation are shown in Figs. I5.30H5.35] Fig. 15.301 displavs a series of snapshots 
of \(j){t, X, y, 0)1 for < t < 4211. During this period of coordinate time the stars orbit one another 
about 2;j times. The orbit is slightly eccentric, as can be seen from Fig. 15.311 which shows the 
orbital trajectory of the centre of each star (defined to be the position of the local maximum 
of |c/)(t, cc, y, 0)1). We note, however, that we have not yet made any attempt to estimate the 
eccentricity. Detailed examination of the trajectories suggests that the orbit is also precessing. Since 
orbital precession is a purely general relativistic effect (assuming that the stars remain spherical in 
their respective rest-frames) this qualitative result provides additional encouraging evidence that 
the CFA is capturing some of the key physical effects that one would see in a solution of the full 
Einstein equations. It would therefore be very interesting to attempt to quantify the observed 
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rate of precession and compare it to the value obtained (or estimated) from a general relativistic 
calculation. This, however, is something that remains to be done. 

Plots of the I2 norms of two of the independent residuals — and Ia{t) — from the compu- 
tations performed on the four different grids are shown in Figs. I5.32H5.331 Here, and for all of the 
remaining figures in this chapter, the colours green, blue, red and black are used for quantities com- 
puted on grids Gi , G2 , G3 and G4 respectively. Additionally, and in contrast to the corresponding 
plots in Sees. 15. 21 - 15. 4l we have not scaled the independent residuals in this section or the next, as the 
resulting graphs are difficult to interpret (exceptions are Fig. 15.331 and Fig. I5.35"|) . Nonetheless, a 
detailed quantitative examination of the residuals, such as that provided by Fig. l5.33l and Fig. l5.35( 
reveals that the independent residuals are converging at the rates expected. 

5.6.2 Case 2: v.j, = 0.07 — Formation of Pulsating & Rotating Boson Star 

Results from this calculation are summarized in Figs. l5.36lH5.38l Fig. 15.361 shows a series of surface 
plots of \4>{t, X, y, 0)1 for < t < 2089, while Fig. 15.371 shows trajectories of the stars prior to their 
merger. In this case orbital motion persists for a little less than one half of a full rotation before the 
stars graze one another. This is followed by a rapid "plunge" phase which leads to the formation of 
what we conjecture to be essentially a single spinning and pulsing boson star, with a central field 
modulus 00 — 0.05. 

As usual, we monitor independent residuals to ensure that our calculations are reliable, and 
Fig. l5.38l shows the I2 norm of Ia{t). We first note that the plot shows that la is tending to as the 
mesh spacing decreases. However, it is also clear from the figure that the convergence deteriorates 
at later times, especially once the stars have merged. Indeed, the apparent divergence of Ia{t) on 
Gi (green) and G2 (blue) for t > 500 is a clear indication that the numerical solutions computed 
on those grids are not trustworthy once the collision has occurred. In addition, the merger results 
in the ejection (radiation) of a significant amount of scalar matter. Some of this ejecta hits the 
outer computational boundary and, due to our use of Dirichlet boundary conditions, is largely 
refiected back into the solution domain. These spurious reflections eventually contaminate the 
entire calculation, so that even though the solution may still exhibit convergence as measured via 
independent residuals, the physical interpretation at late times is dubious at best. 

5.6.3 Case 3: v,j. = 0.05 — Formation of a Black Hole 

Results from our last numerical experiment are shown in Figs. I5.39H5.4T1 here the initial boost 
parameter is Vx ~ 0.05. As usual, we start with surface plots of |0(i, x, y, 0)|, this time on the 
interval < i < 411. Following a very brief period of orbital motion, the two stars quickly plunge 
towards one another and merge, as can clearly be seen in the trajectory plots shown in Fig. 15.401 
In this case we believe that the evolution results in the formation of a single black hole. This 
hypothesis could best be tested by looking for marginally trapped surfaces |120| in our data — that 
is, surfaces with S*^ topology for which the divergence of outgoing null geodesies emanating from 
the surface vanishes. If such a surface was located, then, assuming cosmic censorship |120j holds, 
we could infer that the data described is a black hole. However, at the current time we have 
not implemented this approach. Instead, our conjecture is based on the behaviour of the metric 
functions in the central interaction region. For example, the lapse function attains extremely small 
values in that part of the solution domain, effectively freezing the evolution there. This "collapse 
of the lapse" is a very well known feature of maximal slicing and is usually correlated with the 
formation of a black hole. 

However, we must emphasize that various indicators — such as the plot of the independent 
residuals for the lapse function shown in Fig. 15.411 — strongly suggest that the results for this specific 
choice of Vx are not trustworthy for times t > 380, even on the finest grid used. At later times 
the matter is highly centrally condensed, and, assuming that collapse to a black hole is occurring, 
would become increasingly so as the evolution proceeded. Thus, only a substantial increase in 
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resolution in the interaction region would allow us to provide a definitive answer concerning the 
end state of members of this class of initial data. 

Finally, since code "crashes" — due to floating point overflows, for example — are rather frequent 
occurrences in numerical relativity calculations, we should point out that our code did not "crash" 
at late times for this configuration (or for any other simulation reported in this thesis, for that 
matter). Rather, we simply stopped the evolution at a coordinate time when, even on the finest 
grid, the solution was obviously very poorly resolved. 
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Figure 5.30: This figure shows a series of snapshots of \(f>{t,x,y,0)\ for < t < 4211 from a 
calculation describing long lived orbital motion of two identical boson stars. The two stars, Sq q2 
and 5*0 02j ^re initially centred at = (0, 20, 0) and = (0, —20, 0) and boosted with parameters 
Vj. ~ ~vi^^ ~ v^^ ~ 0.09. During this period of time that is displayed, the stars execute about 
2 J orbits, and the maximum value of \(j>{t,x,y,Q)\ remains approximately constant during the 
evolution. 
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Figure 5.31: Long lived orbital motion: stellar trajectories. This figure shows trajectories 
of the two stars, 6*002 ^i^d <S'oo2j they orbit one another during the coordinate time interval 
< t < 4500. Each red or blue dot represents the local maximum of \4){t,x,y,0)\ at a particular 
instant of time. The orbit is slightly eccentric and is apparently precessing with time. In this and 
other figures of this type, in order to provide some sense of the size of the stars we have also plotted 
circles that approximately delimit the stellar surfaces. Specifically, the solid and dashed lines have 
radii Rgg (containing 99% of A/adm) and Rg^ (containing 95% of Madm), respectively. 
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Figure 5.32: Long Lived Orbital Motion: ||/0i(i)||2- This figure shows the ^2-iiorm of tlie in- 
dependent residuals /^^(t) for < < < 4500. During this interval of coordinate time the stars 
perform about 2^ orbits. Residuals plotted with green, blue, red and black dots were computed 
using grids Gi (coarsest resolution), G2, G3 and G4 (finest resolution), respectively, with specific 
grid parameters defined in the text. Note that these residuals have not been rescaled. The plots 
provide strong evidence of convergence of the independent residuals as /i — >■ 0. 
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Figure 5.33: Long Lived Orbital Motion: Rescaled ||/0i(i)||2- This figure shows the same inde- 
pendent residuals displayed in Fig. I5.32[ but on the shorter time interval < t < 20, and with 
scaling factors /11//12, /11//13, and hi/h^ multiplying the values computed on G2 (blue), G3 (red) 
and G4 (black), respectively. The expected 0{h) convergence of the residuals is apparent. 
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Figure 5.34: Long Lived Orbital Motion: ||/Q(i)||2- This figure shows the Z2-norm of the inde- 
pendent residuals Ia{t) for < ^ < 4500, and as computed on grids Gi (green, coarsest resolution), 
G2 (blue), G3 (red), and G4 (black, finest resolution). As with Fig. 15.321 these residuals have not 
been rescaled. Once more, the plots strongly suggest convergence of the independent residuals as 
h->-0. 



143 



3.4x10 



-5 _ 



3.2x10 



-5 _ 



a 



3x10 



-5 _ 




10 

t 



15 



20 



Figure 5.35: Long lived Orbital Motion: Rcscalcd ||/a(t)||2- This figure shows the same indepen- 
dent residuals displayed in Fig. 15.341 but on the shorter time interval < t < 20, and with scaling 
factors (/ii//i2)^, (hi/h^)^, and (ft.i//i4)^ multiplying the values computed on G2 (blue), G3 (red) 
and G4 (black), respectively. The 0{h?) convergence rate expected for all the residuals associated 
with the metric variables is apparent. 
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Figure 5.36: This figure displays a series of surface plots of \(f>{t, x, y,0)\ for < t < 2089, from a 
calculation in which two identical stars, Sq q2 and 5*0 q2, are initially centred at = (0, 20, 0) and 
= (0,-20,0), and boosted with parameters Vx = — wi^"* = wi^"* = 0.07. The stars subsequently 
execute somewhat less than half of an orbit before they graze one another. A quick "plunge" phase 
follows, leading to the formation of a (conjectured) pulsating and rotating boson star characterized 
by a central field modulus (fio — 0.05. 
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Figure 5.37: Formation of Pulsating and Rotating Boson Star: Trajectories. This figure shows 
the trajectories of the stars and Sq 02 for the computation described in the caption of Fig. 15. 361 
Each red or blue dot plots the position a local maximum of \(j){t, x, y, 0)| at a particular instant of 
time. Here the tracks clearly reveal a short period of orbital motion which is followed by a rapid 
"plunge" and merger of the stars. 
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Figure 5.38: Formation of Pulsating and Rotating Boson Star: ||/Q(t)||2- This figure shows 
the Z2-norm of the independent residual Ia{t) for the caleulation deseribed in Fig. 15.361 We note 
that the residuals have not been resealed in this case, but that the values eomputed on grids Gi 
(green, coarsest resolution), G2 (blue), G3 (red), and Ga (blaek, finest resolution) do seem to be 
converging to as /i 0. However, it also clear that there is a qualitative change in the nature 
of the independent residuals eomputed on Gi and G2 once the collision has occurred at i ~ 500. 
This is a good indication that the calculations on those relatively coarse meshes are not reliable 
for t > 500. 
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Figure 5.39: This figure displays a series of surface plots of \(l){t, x,y,0)\ for < t < 411, from 
a calculation in which two identical stars, S'Qg2 and >5'gQ2, are initially centred at = (0,20,0) 
and = (0, —20, 0), and boosted with parameters = —v^'' = v^'' = 0.05. As with all previous 
plots of this type, the specific results visualized were computed on the finest grid, G"^, which has 
shape = [113, 113, 113]. In this case, the stars rapidly approach each other, and, after a plunge and 
merger phase, apparently undergo gravitational collapse to form a single black hole. At t = 411 
the maximum value of \4>{t,x,y, z)\ (the height of the "spike" in the last frame) is ~ 0.06, and 
even larger values — |(/)(t, a;, y, z)| > 0.14 — are seen at later times. We note that as can be verified 
from Fig. 13. 21 such large values are associated with boson stars on the unstable branch. It thus seems 
plausible that the evolution will lead to black hole formation, but computations with considerably 
more resolution in the central, strongly interacting region would be required to test this hypothesis. 




- -0.05 
= 0.05 



-60 -40 -20 20 40 60 

X 



Figure 5.40: Formation of a Black Hole: Stellar Trajectories. This figure shows the trajectories 
of the stars Sq 02 and Sq q2 from the calculation described in the caption of Fig. 15.391 and during 
the coordinate time interval < t < 1500. Once more, each red and blue dot represents a local 
maximum of \4>(t, x, y, 0)| at a particular instant of time. The resulting tracks show a rapid plunge 
of the stars towards one another, resulting in a merger that we believe ends in black hole formation. 
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Figure 5.41: Formation of a Black Hole: ||/Q(i)||2- This figure shows the /2-norni of the indepen- 
dent residuals Ia{t) for the calculation described in the caption of Fig. 15.391 The residuals have 
not been rescaled. At early times, the residual values computed on grids Gi (green, coarsest reso- 
lution), G2 (blue), Gs (red), and G4 (black, finest resolution) once more appear to be converging 
to as ft, -> 0, However, for t > 400, when we suspect that gravitational collapse to a black hole 
would result if the finite difference resolution was high enough, we see an obvious deterioration in 
the convergence of the residuals computed on all grids. We interpret this as a clear indication that 
none of the computations are reliable for t > 400. 
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5.7 Discussions and Further Developments 

It is worth noting that the second and third experiments discussed in the previous section are 
qualitatively consistent with some of the results reported by Palenzuela et. al. in [10]. Those 
authors have implemented a fully general relativistic 3D code for the Einstein-Klein-Gordon system, 
and have also performed calculations in which the initial data contains two identical spherically 
symmetric boson stars that have been given equal but opposite boosts. One of the calculations 
discussed in [10] is quite similar in setup to our ~ 0.07 calculation described in Sec. 15.6.21 
in which the final state appears to be a pulsating and rotating boson star. The specific stars 
Palenzuela and collaborators used in that case each had an ADM mass, Madm = 0.5, while the 
ones used here (5o.o2) had A^adm — 0.475. They also positioned their stars so that, at least in 
coordinate space, they were closer to one another at the initial time than ours were. Defining d to 
be the coordinate separation of the centres of the stars, they took d = 32. On the other hand, we 
chose d = 40 so that our stars — which have i?99 ~ 17.25 — were essentially completely separated at 
t = Q. These differences in the initial configurations are relatively minor, so we feel that a general 
comparison of our results with theirs is not unreasonable. Specifically, they identified the end state 
of their evolution as a spinning "bar" configuration, accompanied by dispersal of scalar matter, 
and this at least roughly agrees with the interpretation of our end state as a spinning and pulsing 
star. In addition, the range of values of the boost parameter, that led to black hole formation 
in their calculations, namely < I'x < 0.04, is in rough agreement with our findings, where we 
conjecture that black hole formation occurs for Q <Vx < 0.07. 

Palenzuela et. al. also report a very interesting case in which the stars merge into a transient 
single lump of scalar field from which two configurations of approximately the same form as the 
initial stars eventually emerge, and propagate away from one another. We have not observed any 
behaviour of this type in the calculations we have done with our code thus far. 

Arguably, the most novel of the results presented here are those that describe long term orbital 
motion. To our knowledge, the calculations reported in Sec. 15.61 are the first in which more than 
two orbits of a binary boson star have been simulated. Furthermore, we have noted that there is 
strong evidence of orbital precession, which indicates that our model has captured some of the key 
physical effects predicted by general relativity. 

We note that one of the original motivations for this work was to see whether the compression 
effect reported by Wilson and Mathews [111 [ITl [TH] [111 EO] ET] [23l ES] [26] for the case of binary 
neutron star inspiral (and using the conformally flat approximation with maximal slicing) was 
present in our boson star analog of that setup. In this regard our current results are inconclusive. 
In one calculation, where the boost parameter was = 0.08, there did appear to be compression 
of the stars as they orbited. However, this is another case where significantly higher resolution 
is needed to provide a clear answer. Moreover, should something like the Wilson-Mathews effect 
be convincingly seen in our calculations, we would want to vary the scalar field self-interaction 
potential, J7(|(/>oP) to see what impact that had on the results. Here the key point is that the 
reported compression effect for the case of fluid stars depends on the equation of state that is 
adopted. 

We conclude this chapter with a few comments concerning future plans for improvements and 
extensions of our code. The most pressing issue is clearly that of getting adequate finite-difference 
resolution for those instances where the boson stars interact strongly, and especially when black 
holes appear to be forming. This is simply not possible given that our current code uses a single 
uniform mesh, and only runs on one processor. We therefore plan to parallelize the code but, 
perhaps more importantly, to incorporate adaptive mesh refinement (see, for example, the classic 
paper by Berger and Oliger |152j ). so that the increased resolution can be concentrated where it is 
needed. In fact, using the AMRD/PAMR software infrastructure developed by Pretorius |1 77111 78] . 
we have already developed a version of the code that should both run in parallel (with good 
scaling on up to lOO's of processors) and provide AMR. However, this implementation is still being 
debugged and tested. 
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A second component of our algorithm that obviously needs improvement is the treatment of the 
boundary conditions. As discussed in Sec. I2.4."5] a (formally) exact implementation of the physical 
boundary conditions could be achieved through compactification of the spatial domain. Although 
it is promising, this approach would require the development of a more sophisticated multigrid 
solver, capable of efficiently solving highly anisotropic equations, and doing so in parallel. This will 
not be an easy task. Given that we are incorporating AMR into our code — which should allow us 
to substantially extend the boundaries of the computational domain — we feel that it would be most 
fruitful at this time to implement the strategy described in Sec. 12.4.51 which uses mixed (Robin) 
conditions based on known falloff behaviour for the metric variables, and Sommerfeld conditions 
for the scalar field unknowns. 

Wc end by emphasizing that the experiments described above represent only a start of a thor- 
ough investigation of the multi-dimensional parameter space of boson star orbital dynamics. We 
hope that once we have implemented the improvements just discussed, we will be able to use our 
code to survey this space at significantly lower computational cost relative to fully general rela- 
tivistic calculations. Ideally, this survey would identify regions of parameter space that warrant 
more detailed investigation using codes that solve the full Einstein equations. 
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Chapter 6 
Conclusion 



The main new work presented in this thesis involved the implementation of a finite-difference code to 
solve a nonlinear system of elliptic-hyperbolic partial differential equations in 3 space dimensions 
plus time. The PDEs describe the evolution of a self-gravitating massive complex scalar field, 
where the gravitational interaction is given by the conformally flat approximation of the Einstein 
equations. This code was thoroughly tested to establish overall correctness of the implementation 
(in particular, consistency with the continuum PDEs), and was shown to generate results that 
converge to the continuum limit at the expected rate in the mesh spacing. The implementation 
was then used to study interactions between two boson stars. 

In simulations of head-on collisions, we observed the same type of solitonic behaviour that 
has been seen in previous computations using this type of matter |1731 11741 1128| llBOj . We also 
performed a preliminary survey of orbital dynamics of boson star binaries, finding evidence for 
three distinct end states: a long-lived, precessing elliptic orbit, merger of the two stars into a final, 
more compact boson star, and merger that we conjecture produces a black hole. These results are 
consistent with the findings of Palenzuela and collaborators [10] , who solved the full Einstein-scalar 
system. This concordance suggests that the CFA may be sufficiently accurate to capture at least 
some of the key features of general relativistic dynamics, even in fully 3D scenarios. 

The major weakness of our current code is its limited resolution. For many of the runs we have 
described, the resolution was sufficient for us to be confident that our modeling was capturing the 
key physics exhibited in the evolutions. However, in other instances, and especially in those cases 
where we suspect that black holes are forming, the calculations are not reliable at late integration 
times, and only a substantial increase in resolution will fix this flaw. As we have already discussed, 
we are currently working on a parallel, adaptive version of the code that should provide this fix. 

Determining conditions under which the CFA docs or does not provide a good approximation 
of general relativity is another research priority. Ultimately, this pursuit may even be able to shed 
some light on the elusive issue of identifying the true dynamical (radiative) degrees of freedom of 
Einstein's theory. 
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Appendix A 

BSIDPA — Boson Star Initial Data Function in 
Polar Areal Coordinates 



bsidpa is a freely available^ FORTRAN 77 subroutine that generates numerical solutions describ- 
ing static, spherically symmetric boson stars as described in Chap. [3] (the reader should refer to 
that chapter for details concerning notation, definitions of functions used below, etc. which will 
not be repeated here). The routine name is an acronym for Boson Star Initial Data in Polar- 
Areal coordinates, bsidpa has been written so that the user can specify a general polynomial 
self- interaction potential, ?7(0o(-R)), for the scalar field: 



where p„ is the degree of the interaction polynomial, and the coefficients pj, i = 1,2, . . .p„ are 
user-supplied. Note that p„ — 2, with pi = and P2 = 1, is the case considered in this thesis. 
In other words, the potential only has a mass term, which leads to static configurations that are 
sometimes called mini boson stars. As detailed in Chap. |31 for any given potential, the static 
boson stars form a one-parameter family, and it is convenient to use the central value of <?I>o('") — 
i.e. 00 (0) — as the family parameter. Wc also recall that the ODE system (|3.49|) " ()3.52p (hereafter 
often referred to as "the ODEs"), which bsidpa solves numerically using the Isoda integrator 
from ODEPACK [180| . constitutes an eigenvalue problem, with eigenvalue (eigenfrcquency) uj = 
Lo{(j)o(0)). Thus, another key input to bsidpa is the central field value, (/)o(0). If the user knows 
the corresponding eigenfrcquency, that can also be supplied. However, if lo is not known for the 
given value of 4>a{0), bsidpa will attempt to compute it using a bisection algorithm. The bisection 
is based on the observation that, generically, as i? — >■ cxd we have <f>o{R) — )■ oo for cjhi < ^, while 
4>q{R) — >■ — oo for ojio > w Efl- In this latter case, the user can either supply an initial bracket 
[whii'^io] such that ojhi < w < wio, or bsidpa can attempt to locate such a bracket automatically. 
A typical invocation of bsidpa is illustrated by the following fragment of FORTRAN 77 code: 



integer sht , n, pn, pt , dft, tail_type 

real*8 phiO, rmax, lsoda_tol, w_tol, dr, whi, wlo, wshoot, wresca 

real*8 a(n) , alpha(n) , phi(n), pp(n) , m(n) , zr(n), r(n) 

real*8 p(pn) 



call bs idpa ( a , alpha , phi , pp , m , zr , r , p , phiO , rmax , lsoda_t ol , w_t ol , dr , 
it whi , wlo , wshoot , wresca , sht , n , pn , pt , df t , tail_type ) 



''^bsidpa and supporting routines are maintained in the UBC numerical relativity group's ftp repository I179| . 
''^ Observe that in Chap. |3] we used a notation such that uj- = u)^^^ and u)j^ = uJi^. 
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The various arguments to the routine can be classified as being either inputs or outputs. Further, 
some input arguments are required, while others are "optional" , in the sense that a flag can be set 
so that they are assigned default values. 
Input Arguments 

The following input arguments must be supplied: 

phiO: The central value of the scalar field, (/)o(0) 

rmax : The range of the integration: i.e. the ODEs are integrated from R = to R = rmax. 
pt : The potential type. Currently, the only valid value for pt is 

• pt = 1 : Polynomial potential as given by (jA.ip . 

p(pn) : real*8 vector of length pn defining the coefficients Pi of the polynomial potential. 

n : Number of grid points in the interval < i? < rmax at which to compute the solution 
of the ODEs. Specifically, the solution will be computed at Rj = (j — l)Aii , j = 1, 2 . . . , n, 
where An = rmax/(n — 1). 

sht : Flag that controls how the eigenfrequency, wshoot, is to be determined by bsidpa. 
Valid values are as follows: 

• sht ~ : User supplies wshoot. 

• sht = 1 : Routine uses a shooting technique and bisection to determine wshoot to 
within the user-specified tolerance w_tol (see below). Also see documentation of dft 
below which controls whether the routine or user is responsible for determining an initial 
bracket for wshoot. 

wshoot: Solution eigenfrequency as defined above. Note: for sht = this is an input 
argument, while for sht = 1 it is an output argument. 

dft : Flag that controls use of default values for "optional" arguments as follows: 

• dft = : Routine uses default values for lsoda_tol, w_tol, dr. whi and wlo. 

• df t = 1 : Routine uses default values for dr, whi and wlo; user must supply values for 
lsoda_tol and w_tol. 

• dft = 2 : User must supply values for lsoda_tol, w_tol, dr, whi and wlo. 

The "optional" input parameters as well as their default values, are defined as follows: 

lsoda_tol: Isoda tolerance parameter. Isoda will attempt to keep the local error in the 
solution of the ODEs below this value. Default: lsoda_tol = lO^^*'. 

w_tol: Tolerance for computation of the eigenfrequency estimate, wshoot. The bisec- 
tion/shooting algorithm stops when |wiow — ^hi\ < w_tol. Default: w_tol = 10^^ 

dr : Solution output interval used by bsidpa when the algorithm is determining a bracket for 
the eigenvalue wshoot. It is useful to be able to set dr independently of the value An defined 
from rmax and n (see above) since if the estimate wshoot is sufficiently poor, Isoda may fail 
on the interval < i? < Ajj. In this case, setting dr ^ Aji may aid in the bracket- locating 
process. Default: dr = 10"''. 

*^Note, however, that since FORTRAN 77 has no provision for subprograms with varying-length argument lists, 
any invocation of bsidpa must have exactly 23 arguments 
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[whi,wlo] : Values defining the initial bracket for eigenfrcquency wshoot, and which should 
thus satisfy whi < wshoot < wlo. More speeifieally, for w — whi and wlo, respectively 
the solution (j)o{R) should tend to +00 and —00, respectively, for large R. bsidpa does 
have an algorithm encoded to automatically look for an appropriate bracket, but this must 
be used with caution. Automatic bracketing becomes particularly difficult in cases where 
eigenfrequencies corresponding to the ground state, and one or more of the excited states, 
are close to one another. That can happen, for example, when one or more of the potential 
coefficients, pi, are large. When performing parameter space surveys in such instances, it is 
recommended that a continuation method be used, whereby an initial bracket for a solution 
with 0o(O) + S(j)o{0) is given by [uj{(j)o{0)) — Slo, w(0o(O)) + 5uj], and where S(f)o{0) and 6uj 
are adjusted as necessary to ensure proper bracketing. Default: [whi, wlo] = [1.00,1.01] 
(appropriate for a mini boson star with m = 1 [pi = 0, P2 = 1) and (t)oiO) ~ 0.01). 

tail_type: Flag for controlling what closed form expression is used to define (f>o at large 
values of R, via a fitting procedure to the numerical solution of the ODEs. Currently, only 
tail_type = 1 is implemented, which results in a fit to 

Ti(i?) = Aexp(-Bi?), (A.2) 

where A and B are coefficients determined from the fit. 

Output Arguments 

bsidpa returns the following output arguments, all of which are vectors of length n: 
rCn) : The areal coordinate, R. 
aCn) : The computed radial metric function, a(R). 

alphaCn) : The computed lapse function, a{R), rescaled so that limij_j.oo o:{R) = 1. 

phi (n) : The computed scalar field modulus, ^o(^)- 

pp(n) : The computed derivative of the scalar field modulus, ^o{R). 

mCn) : The mass aspect function m(R) = R{1 — a{R)^^)/2. 

zr(n) : The "compactness function" z(R) = 2m{R) / R. This function provides a measure of 
how gravitationally compact a given star is. 

wresca: Rescaled value of w. 

Tab. lA.ll lists eigenfrequencies calculated using bsidpa for a sequence of values of the family 
parameter, 00(0), for the case U{(j)o) = (0o)^- Various properties of these stars are given in 
Tab. IA.2I These include the cutoff radius, i?cutoff , the derivative of m(R) at the cutoff, the ADM 
mass, A/adm, and the two measures of the stellar radius, Rgg and Rg^, that were defined in Sec. 13.41 
Note that the contribution to the total mass from the tail is negligible in all cases. Tab. lA. 3) lists 
additional properties including min^ «(_/?), max^ a(i?), maxjjz(i?), the radius, i?maxz, at which 
z{R) attains its maximum, and the Schwarzschild radius, i?s = 2AfADM, associated with each star. 
The values of max^ z{R) are particularly noteworthy, clearly showing that the overall gravitational 
field of the stars gets stronger as 00 (0) increases. 
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0o(O) 


wshoot 


wresca 


0.005 


1 


01173913657665 





987921656870952 


0.006 


1 


01414229214191 





985551416322374 


0.007 


1 


01656427949667 





983196152260810 


0.008 


1 


01900526873767 





980855788252950 


0.009 


1 


02146543309092 





978530270851457 


0.01 


1 


02394494622946 





976219519074950 


0.02 


1 


04984409445897 





953910295230624 


0.03 


1 


07788555219769 





933010708580807 


0.04 


1 


10828046126291 





913467522154004 


0.05 


1 


14126635619905 





895235350281201 


0.06 


1 


17711107464507 





878276194485022 


0.07 


1 


21611731750891 





862559036385476 


0.08 


1 


25862797105685 





848059515066958 


0.09 


1 


30503232180141 





834759601661149 



Table A.l: Central scalar field values and eigenfrequencies for mini boson stars (i.e. for boson 
stars with interaction potential U{(j)o) = (M^)- Listed are bare and rescaled values of the eigenfre- 
quency, wshoot and wresca, respectively, for a sequence of central scalar field values, 0o(O). The 
calculations were performed using rmax — 100, n = 216 + 1, dr = 10"^, and default values for all 
other parameters. 



00 (0) 


-Rcutoft 


dm/dR\i^^ 


utoff 


Madm 


-R99 


-R95 


0.005 


91 


12 


1.34x10 


-9 


0.2646 


35.78 


28.15 


0.006 


81 


42 


1.13x10 


-9 


0.2878 


32.58 


25.62 


0.007 


73 


58 


1.56x10 


-9 


0.3086 


30.09 


23.65 


0.008 


71 


63 


0.84x10 


-9 


0.3275 


28.08 


22.06 


0.009 


65 


72 


0.92x10 


-9 


0.3449 


26.40 


20.74 


0.01 


63 


85 


1.22x10 


-9 


0.3609 


24.99 


19.63 


0.02 


45 


49 


2.06x10 


-9 


0.4751 


17.26 


13.51 


0.03 


36 


62 


2.62x10 


-9 


0.5424 


13.77 


10.74 


0.04 


31 


99 


3.25x10 


-9 


0.5844 


11.67 


9.065 


0.05 


28 


39 


3.94x10 


-9 


0.6103 


10.22 


7.909 


0.06 


25 


44 


4.83x10 


-9 


0.6251 


9.148 


7.048 


0.07 


23 


74 


5.37x10 


-9 


0.6319 


8.311 


6.375 


0.08 


22 


16 


5.99x10 


-9 


0.6327 


7.637 


5.830 


0.09 


20 


84 


7.21x10 


-9 


0.6290 


7.081 


5.379 



Table A. 2: Mini Boson Star Properties: Tail, Mass and Radius. This table lists the location, 
-Rcutoft, at which (j)o{R) was fit to (|A.2p . dm/dR\ii^^^^tt, the ADM mass of the star, A/adm, and the 
arcal radii containing 99%Madm and 95%Madm. The computational parameters used to obtain 
this data arc described in the caption of Tab. lA.ll Note that the derivative of the mass aspect 
function at -Rcutoff is always extremely small, implying that the contribution to A/adm from the 
tail is negligible. 



167 



00 (0) 


min a(R) 


max'0(i?) 


max z{R) 


Rmax^ z 


Rs 


0.005 


0.9764 


1 


0111 


0.0207 


19.03 


0.529 


0.006 


0.9718 


1 


0134 


0.0248 


17.32 


0.575 


0.007 


0.9672 


1 


0157 


0.0288 


15.98 


0.617 


0.008 


0.9626 


1 


0180 


0.0328 


14.90 


0.655 


0.009 


0.9580 


1 


0203 


0.0367 


14.00 


0.690 


0.01 


0.9534 


1 


0226 


0.0406 


13.24 


0.722 


0.02 


0.9086 


1 


0454 


0.0780 


9.05 


0.950 


0.03 


0.8656 


1 


0678 


0.1124 


7.14 


1.085 


0.04 


0.8242 


1 


0898 


0.1441 


5.98 


1.169 


0.05 


0.7844 


1 


1114 


0.1734 


5.17 


1.220 


0.06 


0.7461 


1 


1325 


0.2004 


4.56 


1.250 


0.07 


0.7093 


1 


1532 


0.2253 


4.08 


1.264 


0.08 


0.6738 


1 


1735 


0.2483 


3.69 


1.265 


0.09 


0.6397 


1 


1933 


0.2694 


3.36 


1.258 



Table A. 3: Mini Boson Star Properties: Extrema of Metric Components. This table lists 
the minimum value of the lapse function, minQ;(i?), the maximum value of the conformal factor, 
ma,xtjj{R) and the maximum values of the compactness function maxz(i?); as well as the location, 
Rmnxz, at which z{R) attains its maximum. For comparison purposes, the Schwarzschild radius, 
Rs = 2Madm, associated with each star, is also tabulated. The computational parameters used to 
obtain this data are described in the caption of Tab. lA.ll 
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Appendix B 

Finite Difference Approximations 



The technique of finite differencing is the most common approach to the discretization of time 
dependent PDEs that has been used in numerical relativity to date. The specific finite difference 
approximations (PDAs) that we employ in this thesis are all straightforward, and the novice can 
consult any basic text on the subject, such as Mitchell and Griffiths [331, details on how they 
can be derived using, for example, Taylor series expansion. The main purpose of this appendix is to 
collect and explicitly display all of the FDAs that are used in the 3D finite difference code described 
in the body of the thesis. In addition, in Sec. IB. 21 we briefly describe a technique which is useful 
for deriving FDAs with good regularity properties when one is working in curvilinear coordinates 
(spherical polar, cylindrical etc.). 



B.l Finite Difference Operators 

We first recall (Sec. 14. ip that we adopt Cartesian (rectangular) coordinates for our computations, 
and that our spatial solution domain is then defined by 



•^min ^ 


X 




(B.l) 


Vmin — 


y 


— ymax5 


(B.2) 


Z < 


Z 


< Z 

— '-'max; 


(B.3) 



where Xmin, aJmax, J/min, i/max, ^min and Zmax are prescribed values. 

We discretize this domain by introducing a finite difference grid (or mesh), in which the same 
constant mesh spacing, h, is used in each of the three coordinate directions. The discrete spatial 
coordinates {xi,yj,Zk) that label the grid points are given by 



such that 





— *^min ^" (^ 


-i)h, 


i = 1 . . .n^r. 


(B.4) 




= 2/min + {j 


-i)h, 


j = l...ny, 


(B.5) 


Zk 


— ^min ~\~ {k 


-i)h, 


k = 1 . . .Hz, 


(B.6) 




3^1 — ^min 


and 




(B.7) 




2/1 = ymin 


and 


Vriy ymaxj 


(B.8) 




Z\ — ■2^min 


and 


^Tiz — ^max- 


(B.9) 



Here there is an implicit assumption that each of the ranges Xmax^^^min- J/max~ 2/min and Zniax~-Zmm 
is evenly divisible by h. 

Furthermore, assuming that the calculation is performed on the time interval Q <t < tmax, we 
similarly discretize the time coordinate by defining values, t", given by 

t" = (n-l)A/i, t=l...nt. (B.IO) 

Here, A — which is the ratio of the temporal and spatial mesh spacings — is known as the Courant 
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number. It will generally satisfy A < 1 from stability considerations, and will be held constant 
when we perform a sequence of computations in which h is varied (when we are performing a 
convergence test, for example). Again, there is a tacit assumption that Xh evenly divides imax- 

For a generic solution unknown, u(t, x, y, z), which is to be approximated on the grid, we then 
introduce the standard finite difference notation 



ul],k = u[f\xr,yj,Zk) , (B.ll) 

for the so-called grid-function values, u" ■ u. 

Having dispensed with basic definitions, we observe that the finite difference approximations 
that we have used fall into two sets. Members of the first set, which are listed in Tables IB.ll - 
IB.4[ were used in the actual discretization of the PDEs governing our model, and are all so- 
called centred approximations. The second set, enumerated in Tables IB.5IIB.12l were used to 
compute independent residuals (see Sec. I4.2.2p . and all of these are either forward or backward 
approximations. In all cases, the approximations are second order in the mesh spacing, h. This 
includes the FDA ()B.15|) for the first time derivative, du/dt, which is used in differencing the 
evolution equations (|2.173p and (|2.174p using a Crank-Nicholson scheme, and where the difference 
equations are centred at the "fictitious" grid point (t"+^/^, x^, z^) = (t" + \h/2,Xi,yj,Zk). 



2h 



u 



du 
dx 
du 



dy ' 2h 

du 
dz 

du 

'dt ' Xh 



i,j + l.k ^i,j-l,k 



2h 



(B.12) 
(B.13) 
(B.14) 
(B.15) 



Table B.l: Centred FDAs of First Order Derivatives. 



^tu 



,n+l 

h,j,k 



(B.16) 



Table B.2: Time Averaging Operator. 



o u 
dx"^ 
d'^u 
dy"^ 

d^u 
dz^ 



2u 



i,j,k 



'■i,3-l,k 



7;" — 2?;" 



(B.17) 
(B.18) 
(B.19) 



Table B.3: Centred FDAs of Second Order Derivatives. 
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^ "i-l,j + l,fc "i+l,j-l,fc ' "i+l,j + l,fc 

?i ^ "i.j-l.fe-l "i,j + l,/c-l "ij-l,fe+l ^ "i,j + l,fc+l 

M "i-l,j,/£-l "i+l,J,fe-l "i-lj-,fe + l + "j+lj-,fe + l 



dxdz Ah"^ 
Table B.4: Centred FDAs of Mixed Second Order Derivatives. 



du 
dx 
du 
dy 

du 







— 7;" 




2h 








_ 7;" 




2h 








— 7/" 


2h 







I- 7;" 




2h 








1" '^Lj-2,k 




2h 








^ ^i^i.fc-2 


2/i 



9?/' 







+ 47;" 


— 7;" 














+ 4<,+2,fc 


— 7/" 










2<.,fc 


— ^7/" 


+ 47;" 


— 7/" 



i9z2 /i^ 
Table B.7: Forward FDAs of Second Order Derivatives. 



(B.20) 
(B.21) 
(B.22) 



(B.23) 
(B.24) 



Table B.5: Forward FDAs of first order derivatives. 



_^ 1-3, k t-l,j,fc i-^,]:k /g 2g') 

.'?7," . — .4- 7/" _ , 

(B.27) 
(B.28) 



dy 

du 
dz 

Table B.6: Backward FDAs of First Order Derivatives. 



(B.30) 
(B.31) 
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dy 







/l2 






+ 4?;" — 












-1 + 4<,,fc_2 - <,,fc-3 



(B.32) 
(B.33) 



C U t,J,K ■l,J,K—± ■ ■l,J,K-Z ■l,J,K-0 OA\ 
^ ^^-^^^ 



Table B.8: Backward FDAs of Second Order Derivatives. 



9x9?/ 4/i2 

3?/" — 4?/" 4- 7/" 

+ ^^2 (^-^5) 



37/" — 47/" 4- 7/" 

+ 4^5 (^-^^^ 



dxdz Ah? 

37/" — 4//" + ?/" 

'^"i+2,j,fc ^"i+2,j,fc+l ' ^i+2.j,k+2 
4/l2 

Table B.9: Forward-Forward FDAs of Mixed Second Order Derivatives. 



—37/" +12?/" — Q?/" +47/" — Ifi?;" 



(B.37) 



i9a;i9y 4/i2 



I?//" —7/" -1-47/" —37/" 



4/l2 



9?/92 4ft,2 

197/" —7/" -1-47/" —37/" 

-^"^"ij-+l,/c "jj-+2,A;-2 ^"i,j+2,fc-l ■^"'i.j+2.k 

4^2 



(B.39) 



9a:9z 4ft,2 

197," —7/" -t- 47/" —37/" 

, -^^"i-l-l j".fe "i+2.j,/c-2 ^"j-|-2j".fe-l '^"'i+2,j.k 

+ i)^2 (B-40) 



Table B.IO: Forward-Backward FDAs of Mixed Second Order Derivatives. 
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+47;" —7/" -1-197/" — 1fi7/" 



4,;" — Q7;" -I- 197;" ~ "^7;" 

+ (B-41) 



+ ^2 (B-42) 



dxdz 4/1^ 

4/l2 

Table B.ll: Backward-Forward FDAs of Mixed Second Order Derivatives. 



(B.43) 



Ft'^n, 11" —4?;" +37;" —47/" +167/" —127/" 



+ 4^5 ^^-^^^ 



, 3<,,,_,-12<^,,_i+9<v, ^^^^^^ 



4/1^ 

3<,,.-2-12<,,.-i+9<,,, ^^^^^ 



4/i2 



Table B.12: Backward-Backward FDAs of Mixed Second Order Derivatives. 
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B.2 FDAs for Operators of the Form d/d{rP) 

In this section we briefly discuss a technique that is useful for deriving finite difference approxi- 
mations that yield discrete solutions with good regularity behaviour in the vicinity of coordinate 
singularities. We note that the method was introduced to the numerical relativity community by 
Evans in his PhD dissertation |155j . 

The technique is best illustrated by way of example. We thus consider the radial part of the 
Laplacian operator in spherical polar coordinates, {r,9,(j)). Assuming our unknown function, u, 
depends only on r, we then have 



1 d 



J.2 Qj, 



dr 



In order that the solution be regular (smooth) at the origin, r = 0, u{r) must satisfy 

lim u{r) = uq + U2r^ + 0{r'^) , 



(B.47) 



(B. 



where uq and U2 are constants. 

A key observation is that if we insert the above expansion into the right hand side of (|B.47p we 
find 



1 d 



, du 
dr 



J.2 Qj, 



d 



r-j-{uo + U2r^ + 0{r^)) 



{2u2r^ + 0{r^)) = 6u2 + 0(r=^) . 



(B.49) 



We now discrete the right hand side of (jB.47|) by first introducing a uniform radial grid. 



(j — l)h, j — 1,2, . . . Ur, where h is the mesh spacing, and then writing 

, du 



l_d_ 

^2 Qj. 



dr 



1 



1 {Uj + l - Uj) ~ Hi (Uj - Uj^l) 

2 -^2 



{rj+i/2-rj-i/2y 



(B.50) 



Here r 



J + l/2 



h/2 and 



"j-l/2 



h/2. The approximation (jB.50|) is second order and 



seems natural enough, especially if one requires that the expression involve only the values u 



and 



However, we now consider evaluating (|B.50p for u(r) given by truncating (|B.48|) at 



O(r^), i.e. for u{r) = uo 
elementary algebra that 



U2r , where, again, uq and U2 are constants. Then wc find after some 



= 6U2 



2 r2 



(B.51) 



This is not the correct leading order behaviour given by (|B.49|) . In particular, at the second grid 
point, r2 = h, this evaluates to (6^)m2, instead of 6u2, so that irrespective of how small wc make h, 
we will never capture the correct leading order behaviour of the differential operator in the vicinity 
of r = 0. Especially in dynamical evolutions, this can lead a lack of solution smoothness near the 
origin, and even to numerical instability. 

However, with a simple change of variables, and an application of the chain rule, we can rewrite 
the radial Laplacian operator as follows 
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which can then be discrctized as 



d 



, du 



d{r^) \ dr 



y,3 ^ ^3 



[Uj+l - Uj) 

(rj -rj-i) 



(Uj-Uj^l) 



0> ^2 ^ 



(B.53) 



When this finite difference approximation is evaluated for u{r) = uo + U2r'^ wc find 



3+i 



J 2 r. 



'3 '3 



^3-1, 



= 6M2 . 



(B.54) 



Thus, the correct leading order behaviour as r — is mirrored in the discretization through this 
change-of-variables trick. 

The generalization of this strategy is straightforward. When confronted with a differential 
expression of the general form 

df{r) 



dr 



where /(r) satisfies the regularity condition 



lim /(r) = CpvP + c„+2''' 



p+2 



for some integer p > 0, we rewrite (|B.55p as 



-1 df 



d{rP) 



(B.55) 
(B.56) 

(B.57) 



prior to finite differencing. 
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Appendix C 

PDEFDAOFF: A MAPLE Package for PDAS 



This appendix documents the use of the package PDEFDAOFF (an acronym for Partial Differen- 
tial Equation to Finite Difference Approximation using OFF-centred and centred approximation 
schemes). The package is a set of Mapl(|^ procedures we wrote to assist in the process of dis- 
cretizing the PDEs governing our model using finite difference methods. The package contains two 
key Maple procedures, df dandof f and resndof f which are discussed with illustrative examples in 
Secs. lC.l1 and lC.2l below. We note that the examples were extracted directly from the demonstration 
worksheet that is included in the distribution jl81| . 

C.l FDAs of n-dimensional Differential Operators 

df dandof f is a Maple procedure that computes a finite difference approximation to a differential 
operator, as applied to function of an arbitrary number of coordinates (a;^,a;^, . . . ,x"). Given a 
user-specified order of accuracy, p, the routine returns a finite difference approximation which is 
0{W) accurate in all coordinate directions. This procedure can calculate both centred and off- 
centred finite difference formula. 

The routine has a header given by 

dfdcindoff := proc (u :: function, x : : list (naime) , ud::name, j :: list (name) , 
h :: list (name) , q: : list (integer) , p:: integer, 
off : : list (integer) ) 

where the procedure arguments arc defined as follows: 

1. u: A generic Maple function of n variables. 

2. x: Length-?! list of independent variables (coordinates), in alphabetical order. 

3. ud: Maple name for the finite difference unknown. 

4. j : Length-?? list of names of indices corresponding to each coordinate. 

5. h: Length-r? list of coordinate spacings in each direction. 

6. q: Length-?? list specifying the order of the differential operator with respect to each coordi- 
nate direction. 

7. p: Approximation order for finite difference scheme 

8. off: Length-?? list of offsets (one of —1, or 1) which defines the off-centring of the approx- 
imation in each coordinate direction. 



° Maple is a registered trademark of Waterloo Maple Inc. 
*^df dandof f is an acronym for differential finite difference approximation for n-dimensional functions including 
off-centred schemes. 
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C.1.1 1-dimensional Examples 

We first define a Maple afias for u{x) wliicli suppresses tlie functional dependence on output, and 
allows us to omit the explicit dependence on input: 

> alias (u=u(x) ) ; 

u 

Our initial examples deal with first derivatives and illustrate the importance of proper spec- 
ification of the "off-centring" argument, off. Thus, if one naively attempts to find a first order 
centred scheme for du/dx, by using [0] as the value of off, the procedure executes as follows: 

> dfdandoffCu, [x] , ud, [i] , [hx] , [1], 1, [0]); 

Error, (in dfdandoff) invalid input: rhs received 1, which is not valid 
for its 1st argument, expr 

We do not consider this a bug since there is no 0{h) centred scheme for d/dx. Indeed, the simplest 
FDAs of d/dx arc the first order backward and forward approximations. Wc can generate the 
backward formula using off := [-1] : 

> dfdandoffCu, [x] , ud, [i] , [hx] , [1], 1, [-1]); 

d udi^i udi 

dx ' hx hx 

To verify the correctness of the finite difference operators that dfdajidof f computes — which 
means that we ensure that that approximations are 0{hP) accurate — we use another Maple proce- 
dure, f daeval, due to Choptuik and which is included in PDEFDAOFF. f daeval has the header 

fdaeval := proc(c: : array (integer) , sigma: : algebraic) 

with arguments defined as follows: 

1. c: A coefficient array such that c[j] is the relative weight of the unknown, Wi+j, in the finite 
difference approximation centred at Xi. 

2. sigma: An overall scale factor to be applied to the formula. 
The input to fdaeval thus defines an expression of the form 

Jmax jmax 

a ^ CjUi+j=a ^ Cju{x.i+jh) (C.l) 

where h is the (constant) mesh spacing that is hard-coded in the procedure, fdaeval then replaces 
all of the Ui+j = u(xi + jh) with Taylor series expansions (in h) about the value Ui = u{xi). 
Provided that (jC.ip does define a consistent approximation to some differential operator applied 
at a; = Xi, fdaeval will return that operator (acting on u, which again is hard coded into the 
routine), as well as the leading order truncation error terms. 

Thus, we can check the above invocation of dfdandoff as follows: 

> abwh := array (-1..0, [-1,1]); 

abwh := ARRAY{-l..Q, [0 = 1, -1 = -1]) 

> fdaevaKabwh, 1/h) ; 
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D (u) (x) - i D(2) (u) (x) /i + i D(3) (u) {x) 

This shows that, as expected, our backward formula is a valid first order finite difference approxi- 
mation of du/dx. 

Continuing, we can generate and check the 0{h?) centred, forward and backward approxima- 
tions of du/dx as follows: 

> # Centred approximation 

> dfdandoffCu, [x] , ud, [i] , [hx] , [1], 2, [0]); 

d 1 ttdi-i 1 udi^i 

> ach2 := array [-1,0,1]): 

> fdaeval(ach2, l/(2*h)); 

D {u) {x) -f i D(3) {u) {x) 

> # Backward approximation 

> dfdandoffCu, [x], ud,[i], [hx] , [1], 2, [-1]); 

d 1 2 '^'^'-1 ^ "'^i 

9.T '' 2 hx hx 2 hx 

> abwh2 := array(-2..0, [1,-4,3]): 

> f daeval(abwh2, l/(2*li)); 

D {u) (x) - i d(3) (u) (x) h^ + ^ d(4) (u) (x) h"^ 

> # Forward approximation 

> dfdandoffCu, [x] , ud, [i] , [hx] , [1], 2, [1]); 

d ^ _ 3 ud, ^ ^ ud.i+i 1 udi+2 
dx ^ 2 hx hx 2 hx 

> afwh2 := array(0..2, [-3,4,-1]): 

> fdaeval(afwh2, l/(2*h)); 

D {u) {x) - i d(3) {u) {x) h^-\ D^"^ (w) (2:) h^ 

We conclude this section with examples illustrating the computation of 0{h'^) accurate approx- 
imations for the second derivative, d'^u/dx'^ . 

> # Centred approximation 

> dfdandoffCu, [x] , ud, [i] , [hx] , [2], 4, [0]); 

1 Mrf,_2 4 ttdi-i 5 udi 4 udi^i 1 
9^""^ = "12 1^ + 3 ~ 2 ^ + 3 1^ " 12 1^ 

> a2ch4 := array (-2.. 2, [-1,16,-30,16,-1]): 

> fdaeval(a2ch4, 1/ (12*h**2) ) ; 

> # Forward approximation 

> dfdandoffCu, [x] , ud, [i] , [hx] , [2], 4, [2]); 

15 udi 77 wrfi+i 107 „ ''^rfi+s 61 udi+4^ 5 

a^"* " T X? " ¥ + ~ 1^ " + 12 ~ 6 

> a2fw2h4 := array(0..5, [45,-154,214,-156,61,-10]): 

> fdaeval(a2fw2h4, l/(12*h**2)) ; 

D(^) {u) [x) - |I D(6) (.) (^) _ g (.) (x) /.^ 
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C.1.2 3-dimensional Example 

The following example illustrates the use of df dandof f for functions with dependence on three 
independent variables. 

> alias(w=w(x,y,z)) ; 

w 

> # Second order forward approximation of d/dz 

> df dandof f(w, [x,y,z] , wd, [i,j,k], [hx,hy,liz] , [0,0,1], 2, [0,0,1]); 

dz ^'■'"'^ 2 hz hz 2 hz 

C.2 Residual Evaluators for n-dimensional PDEs 

resndof f IffI is a Maple procedure that automatically generates residuals evaluators associated with 
the finite-difference discretization of a general partial differential equation in n-dimensions. This 
routine was initially written with elliptic PDEs in mind, and in conjunction with the multigrid 
method, where the evaluation of residuals plays an important role when a relaxation technique is 
used as a smoother. However resndof f can also be employed in the context of time-dependent 
PDEs. In particular, it is very useful for generating independent residual evaluators for all types 
of PDEs, including time-dependent ones. As discussed in Chap. |4l independent residual evaluation 
is a very powerful tool for verifying the implementation of finite difference codes, especially when 
one can be confident that the independent residual evaluators are themselves error-free. Use of a 
routine such as resndof f helps ensure that this is the case, 
resndof f has the following header: 

resndof f := proc(eqn: : equation, u: : list (function) , x :: list (name) , 

ud: :list(ncmie) , j : : list (narnie) , h: : list (name) , p:: integer, 
of f : : list (integer) ) 

with procedure arguments defined as follows: 

1. eqn: Partial differential equation for which independent residual is to be computed. 

2. u: List of Maple names of the functions appearing in the PDE. 

3. x: List of independent variables (coordinates). 

4. ud: List of Maple names for the discrete representations of the functions (i.e. the grid function 
names). 

5. j: Length-n list of names of indices corresponding to each coordinate. 

6. h: Length-71 list of coordinate spacings in each direction. 

7. p: Approximation order for finite difference scheme. 

8. off: Length- ri list of offsets which defines the off-centring of the approximation in each 
coordinate direction. 

Its use is illustrated here for the case of a nonlinear Poisson equation in 3 dimensions, where 
compactified Cartesian coordinates are adopted: 



**resndoff is an acronym for residual evaluator for n-dimensional PDEs including off-centred finite difference 
schemes. 
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> # Define aliases for the functional dependence of u and f . 

> alias (u=u(chi , eta, zeta) , f=f (chi,eta,zeta)) ; 

u, f 

> # Define the PDE. 

> P0I3DCP := (l-chi**2)*diff ((l-chi**2)*diff (u,chi) ,chi)+ 

(l-eta**2) *dif f ( (l-eta**2) *dif f (u, eta) , eta) + 

(l-zeta**2)*dif f ((l-zeta**2)*dif f (u.zeta) ,zeta)+sigma*u**2=f ; 




> # Generate the residual 

> resndoff(P013DCP, [u,f], [chi, eta, zeta] , [ud,fd] , [i,j,k], [hx,hy,hz] , 

2, [0,0,0]); 



1 udi^ij,k 1 udi+i,j^k 

2 hx 2 hx 



+ (1 - X.') 



, 1 Mrfi.j-l.fc ^ 1 Udjj+l^k 

'^^l 2 hy +2 hy 



-2Cfe - 



1 ud 



1 ud 



hz 



i,j,k+l 

hz 



\ hx^ hx^ hx"^ 

V M J. 2 ^ , 2 + ,2 

udij.k udi.j,k+i 



1 - Ck' 



ud 



hz 



hz"^ hz"^ 
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Appendix D 

Review of Relaxation Methods 



In this appendix we provide a quick review of relaxation techniques as appUed to large sparse 
systems of algebraic equations (both linear and nonlinear). Relaxation methods are used in two 
distinct capacities in the 3D code that we describe in the main body of the thesis: 

1. In the case of the hyperbolic PDEs, point-wise Newton-Gauss-Scidel relaxation is used to solve 
the time-implicit algebraic equations that result from our use of an 0{h?) Crank-Nicholson 
scheme. 

2. For the elliptic PDEs, point-wise Ncwton-Gauss-Scidcl is used as a smoother in the context 
of the multigrid solution of the discrete equations, which again arise from an 0(h?) approxi- 
mation of the PDEs. 

The reader who is interested in full details concerning relaxation methods and related iterative 
techniques for the solution of large systems of equations should consult the classic reference by 
Varga [T68] . 

D.l The Linear Case: Jacobi and Gauss- Seidel Relaxation 

We write a general linear system of equations in the form 

Lu = f, (D.l) 

where L is a A'' x A'' matrix, and u and f are A^-component column vectors. In the typical case 
of interest, u will enumerate all of the discrete unknowns, u{t"',Xi,yj, Zk)-, associated with the 
finite difference approximation of some continuum function, u{t, x,y, z), at some specific discrete 
time, t = t"" . The size of the system will then be given hy N = rix x ny x riz, where n^, riy and 
Uz are the number of grid points in the x, y and z directions, respectively. Due to the locality 
of finite difference operators, the matrix L will generally be very sparse: that is, unless n^, riy 
and Uz are very small, most of its entries will be 0, and the fraction of non-zeros will in fact 
tend to as the mesh spacing h approaches 0. As is well known, even if one adopts a specially 
crafted ordering of the unknowns (see [164| for example), the solution of (|D.1[) using standard 
methods of numerical linear algebra — that is, using some form of Gaussian elimination such as LU 
decomposition [160j — is extremely inefficient. This is already the case for discretizations of problems 
in 2 spatial dimensions, and is even more so for 3D calculations. For example, consider the situation 
where the 3D Poisson equation in (a;, y, z) coordinates has been discretized on a uniform mesh with 
n = Tlx ~ Uy = Uz grid points per edge of the computational domain. Then the size of the system is 
= m?. We assume that we have used the standard 0[h?) 7-point approximation for the discrete 
Laplacian, which couples a particular unknown to its nearest neighbours in each of the three 
coordinate directions, and that we have numbered the unknowns in so called lexicographic order 
(((Mjjfc, z = 1, . . . n), j = 1, . . . n), fc = 1, . . . n). Then the bandwidth of L is O(n^) ~ 0(Af^/^), and 
the cost for solution via Gaussian elimination will be 0{N'^^^) in computational time and up to 
0{N^/'^) in memory. In practice this precludes direct solution of (|D.1[) for any but the smallest 3D 
problems. 

The class of iterative techniques known as relaxation methods were developed decades ago 
largely in response to the need to efficiently solve the large systems of algebraic equations arising 
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from the finitc-diffcrcncc discretization of PDEs, particularly those of elliptic type. Relaxation 
methods directly exploit the sparsity structure of these systems, and have the further advantage of 
having extremely economical memory requirements, since the system of equations is not explicitly 
stored. 

We thus start with the general notion of an iterative, or fixed point technique for the solution 
of (jP.ip . We note that the system can be rewritten as 

u = Tu + c, (D.2) 

where T is the N x N update matrix and c is a constant vector that depends on the specific iterative 
method used. The idea is to choose T and c such that starting from some initial estimate, u^°) , of 
u, the iteration 

^(fc+i) ^ Tu(fc) ^ J, ^ (D.3) 
has a fixed point u which satisfies (|D.1|) . That is, we want 

lim u^'^') = u . (D.4) 

k^oo 

Whether or not the iteration (|D.3[) converges is determined by the eigenvalues of the update matrix 
T. More precisely, the necessary and sufficient condition for convergence is that the spectral radius, 
p(T), of T satisfy p{T) < 1. We recall that the spectral radius may be defined as 

p{T) = max I A, I, (D.5) 

l<i<Af 

where Xi are the eigenvalues of T. For strictly diagonally dominant matrices T, which satisfy 

N 



two iterative schemes become particularly interesting since they are known to converge for any 
initial estimate u'-^'. This observation is pertinent to the case of equations resulting from FDA 
approximation since such systems often satisfy some form of diagonal dominance, which although 
frequently weaker than (jD.6|) . is still enough to ensure convergence of the methods we will outline. 

Although the formal analysis of relaxation methods (again, see jl68j ) typically proceeds via a 
matrix approach which involves a decomposition, or splitting, of L, for our purposes it is more 
intuitive to describe the techniques as they are typically implemented as computer code. We thus 
write (|D.ip in the form 

N 

Y,U,u,=f,, i= l,2,...,iV, (D.7) 

which amounts to explicitly writing out the action of each row of L on u, and equating it to the 
corresponding element of f . We then view any of the N subsystems of equations defined by (jD.7p 
as an equation for the single unknown, Ui. Solving for Ui we get 



1 

Ui = — 



N 



(D.; 



and the sequence of approximations starting from the initial guess u^^^ can be generated component 
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by component using 



(k) 
u, = 



La 



N 



1 = 1,2,. ..,N. 



(D.9) 



The method defined by (jP.Qp is known as Jacobi iteration. We note that it requires that we 
store values for both the current approximation, u^*^', as well as the previous estimate, u^'^"^^. 
Crucially, however, for systems derived from FDAs there is no need to explicitly store the elements 
of L. Rather, the computation of the right hand side of (|D.9|) simply amounts to the evaluation of 
the finite difference formula at a given grid point, so that the components Lij are effectively "hard 
coded" into the computer program. Thus, the memory requirements for the technique — and for all 
relaxation methods — are the optimal 0{N). We also note at this point that the complete process 
of iterating the solution estimate from u^*^^ to u^'^+^' is frequently called a relaxation sweep, in 
appeal to the notion that one "sweeps" through the finite difference grid, updating one discrete 
unknown at a time. 

The other relaxation technique we wish to consider can be viewed as a simple modification of the 
Jacobi iteration that aims to speed convergence by using the most recently computed components 
of the solution estimate when they are available. So, assuming that we are updating the unknowns 
Mi^*''-' in the order i = 1,2, . . . , N , then for j < i we will have already computed Uj^''\ while for 



j > i we will need to use values u 



We thus modify (|D.9[) as follows: 



1 

La 



N 

E 



(fe-i) 



1,2,, 



(D.IO) 



This defines the Gauss-Seidel iteration for the solution of (jD.l[) . In practice, Gauss-Seidel relaxation 
doe.s tend to require fewer sweeps than Jacobi to achieve convergence to a specified tolerance. 
Additionally, Gauss-Seidel iteration only needs storage for a single instance of the unknown vector, 
which is half of that required by the Jacobi method. 

We should also emphasize at this juncture that the methods given by (|D.9[) and (jP.lOp can be 
further qualified as defining point-wise Jacobi and point-wise Gauss-Seidel relaxations, respectively. 
This distinguishes the techniques from extensions of the methods in which groups of unknowns — 
u{xi,yj, Zk), i = 1,..., N, for fixed j and k, for example — are updated simultaneously. These 
generalizations are known as line relaxation methods, and although we have mentioned them briefly 
in Sec. 12.4.51 we did not use them in any of the work described in this thesis. 

In preparation for our discussion of nonlinear systems in the next section it is useful to describe 
the point-wise Gauss-Seidel iteration (jP.lOl) in terms of the residual vector associated with the 
approximate solution. First, given the approximation u*-*^^ that is defined after the completion of 
the fc-th relaxation sweep, we define the residual vector r^'^^ by 



rW^LuW-f, 



(D.ll) 



so that, assuming that the Gauss-Seidel iteration converges, we have 

lim r^'') = , 



(D.12) 



where is the A^-component 0-vector. We again emphasize that we can only compute r*^ after the 
fc-th pass through the solution unknowns has been completed. We therefore further introduce the 
concept of a running residual, generically denoted r, which has components that are continually 



*®It should be noted, however, that both Gauss-Seidel and Jacobi generally have the same asymptotic convergence 
rate [168) . 
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changing as individual unknowns arc updated. Wc thus define the (running) unknown vector u^^'^^ 
which at any iteration fc, has the foUowing "instantaneous" values when we have just updated 
unknown z — 1 and are about to update unknown i: 



~[k) 



, . . . , aj^ 



With this definition, the running residual, ri*-'^-', is given by 



(D.13) 



(D.14) 



where the subscript i reminds us that r^^'^) has components that arc constantly changing as we 
sweep through the grid, updating unknowns one by one. Using (jP.lOp and (jD.14p , we can explicitly 
write the components, [rj^'^^Jm, of the running residual vector as 



^ I Jm — ^ ^ 



(fe) 



N 



(fc-i) 



,(fe-l) 



(D.15) 



For the particular case of the i-th residual this last expression gives us 

N 



i-l 



(k-i) „ 



(D.16) 



which upon insertion into (|D.10p yields a more compact form of the point-wise Gauss-Seidel itera- 
tion: 



(fe) (fe-i) 



(D.17) 



This expression will be generalized to the case of nonlinear relaxation in the next section. 



D.2 The Nonlinear Case: Newton-Gauss-Seidel Relaxation 

In situations such as ours, where the governing PDEs are nonlinear, finite difference approximation 
will naturally lead to large sparse, systems of nonlinear algebraic equations. Newton's method p,60] 
then provides a route to extend the techniques described above so that they can be used to solve 
nonlinear systems. 

We begin then by reviewing Newton's method for the solution of single nonlinear equation in a 
single unknown, u. We write the equation in the canonical form 

/(u)=0, (D.18) 

and note that the technique is itself iterative. Thus we must start with some initial estimate, ul^l, 
of the solution, and then generate iterates, ul"!, such that, assuming that the method converges 
(which generally is dependent on the quality of the initial guess), we have lim„_i.oo ~ u. The 
Newton iteration for (|D.18[) is then given by 



u 



[n+l] 



(D.19) 



/' (tiN) ' 

where /'(u) = df{u)/du. Defining the residual, r^"!, associated with the iterate, m'"', as 

r["l EE / (uN j ^ (D.20) 
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we can rewrite (|D.19p as 



^+l] 



(D.21) 



We now assume that the system of N equations that resuh from the finite difference discretiza- 
tion of our PDEs has been written in the canonical form 

F,[u]=0, i^l,2,...,N, (D.22) 

where each of the Fi, i = 1, N is a. nonlinear function of the unknowns, u = [ui,U2, ■ ■ ■ , un]"^ 
(but only a few of them in general, again due to the locality of finite difference operators). 

The application of relaxation to nonlinear equations again involves relaxation sweeps through 
the unknowns, so focusing on the extension of the Gauss-Seidel method, and in analogy to what 
we did in the previous section, we define the components, [r^'^'^^Jm, of the running residual vector 

by 

[f,W]„ = F„[u,W]. (D.23) 

Here, the running solution vector, Ui^^\ is defined by (jD.13|) as previously, and the subscript i 
again emphasizes that the elements of the running vectors are constantly changing as we sweep 
through the grid, updating each unknown in turn. 

With this definition, and using (jD.21l) . the (one-step) point-wise Newton-Gauss-Seidel relax- 
ation method is defined by 

«,W=,X'=-i)-l£l|!lll, (D.24) 
where the "diagonal Jacobian element" , Ja is given by 

dF, [u 



'J a — 



dui 



(D.25) 

=ti,-<'=-i) 



The additional nomenclature "one step" indicates that we do not use Newton's method to fully 
solve any of the individual nonlinear equations as we visit the unknowns; instead, we apply only one 
iteration of the Newton technique before moving on to the next unknown. This strategy is largely 
motivated by the observation that a complete solution of any individual equation will generally 
represent some amount of wasted computational work, since as soon as one of the neighbouring 
unknowns is modified by the subsequent relaxation process, the equation will no longer be satisfied. 



